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PATENT 

ATTORNEY DOCKET NO: 00786/368002 

MODIFICATION OF PLANT STORAGE RESERVES 

5 Background of the Invention 

This application claims benefit of U.S. provisional application 60/128,651, 
filed on April 8, 1999. 

This invention relates to the deposition of plant storage reserve materials (e.g., 
seed reserve material), the biogenesis of storage organelles, and the production of 
1 0 transgenic plants having altered storage reserve profiles. 

Flowering plants deposit extra food reserves in their seeds to support young 
seedling growth. The storage compounds in various seeds are principally composed of 
carbohydrates, proteins, and lipids. The relative amounts of these substances, however, 
differ widely among species (Bewley and Black, Seeds: Physiology of Development and 
15 Germination, Plenum Press, New York, 2 nd edition, Chap 1, 1994). For instance, cereal 
plants deposit relatively more carbohydrates, while legume seeds and oilseeds contain 
relatively more proteins and lipids, respectively (Bewley and Black, supra; Vitale and 
Bollini, In: Seed Development and Germination, J. Kigel and G. Galili, eds., Marcel 
Dekker, Inc., New York, 1995, pp. 73-102; Miquel and Browse, ibid, pp. 169-193). 
20 Although these variations are largely controlled by genetic factors, the molecular 
mechanisms that account for these different developmental programs for storage 
deposition are mostly unknown. 

Directly and indirectly (through feeding to livestock), seeds also make up the 
major sources of human diet. Therefore, improving the nutritive value of crop seeds has 
25 been of great interest. The efforts to date, however, have been made mainly through 

breeding (Payne, In: Seed Proteins, J. Daussant, J. Mosse, and J. Vaughan, eds. Academic 
Press, London, 1983, pp. 223-253; W. Gottschalk and H. P. Muller, eds, Seed Proteins: 
Biochemistry, Genetics, Nutritive Value, The Hague, The Netherlands, 1983). 



Summary of the Invention 
In general, the invention features an isolated nucleic acid molecule which 
includes a sequence encoding an SSE polypeptide. Preferably, the isolated nucleic acid 
molecule which includes a sequence encoding a polypeptide that is substantially identical 
5 to SSE 1 (SEQ ID NO:2). In other preferred embodiments, the sequence encodes an SSE 
polypeptide having at least 30% identity with the amino acid sequence shown in Fig. 2B 
(SEQ ID NO:2). In other preferred embodiments, the sequence encodes an SSE 
polypeptide that, when expressed in a cell of a plant, modifies or alters the production of 
a food storage reserve material (e.g., protein, lipid, or carbohydrate storage reserve); 
10 facilitates the intracellular transport of a storage protein; or facilitates the formation of 
protein or oil bodies. In still other preferred embodiments, the nucleic acid molecule is a 
cDNA molecule. 

In another aspect, the invention features an isolated nucleic acid molecule 
which includes a sequence that encodes an SSE polypeptide, wherein the isolated nucleic 

1 5 acid molecule hybridizes specifically to a nucleic acid molecule that includes the cDNA 
of Fig. 2A (SEQ ID NO:l). In preferred embodiments, the isolated nucleic acid sequence 
encodes an SSE polypeptide having at least 30% identity with the amino acid sequence 
shown in Fig. 2B (SEQ ID NO:2). 

In related aspects, the invention features a transgenic plant (or plant cell, plant 

20 tissue, plant organ, or plant component) which includes a recombinant transgene that 

expresses an SSE polypeptide, wherein the transgene is expressed in the transgenic plant 
under the control of an expression control region that is functional in a plant cell. The 
invention further features seeds and cells produced by a transgenic plant which includes 
such a recombinant transgene. 

25 In other related aspects, the invention features a sense-oriented expression 

vector which includes any of the aforementioned nucleic acid molecules; the vector being 
capable of directing expression of the SSE polypeptide encoded by the nucleic acid 
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molecule. In this regard, the invention also includes a cell (e.g., a bacterial or plant cell) 
or a transgenic plant or transgenic plant component that includes such an expression 
vector. 

In other aspects, the invention features an expression vector for producing 
5 antisense SSE RNA; a transgenic plant or transgenic plant component including such an 
antisense vector; and seeds or cells produced by a transgenic plant or transgenic plant 
component that express the antisense construct. 

In another aspect, the invention features a substantially pure SSE polypeptide 
that includes an amino acid sequence having at least 30% identity to the amino acid 
10 sequence of Fig. 2B (SEQ ID NO:2). In preferred embodiments, the polypeptide 
modifies or alters the production of a storage reserve (e.g., a protein or lipid storage 
reserve); facilitates the intracellular transport of a storage protein or lipid; or facilitates 
the formation of protein bodies or oil bodies. 

In another aspect, the invention features a method of producing an SSE 
15 polypeptide, the method includes the steps of: (a) providing a cell transformed with a 
nucleic acid molecule of the invention positioned for expression in the cell; (b) culturing 
the transformed cell under conditions for expressing the nucleic acid molecule; and (c) 
recovering the SSE polypeptide. Recombinant SSE polypeptides produced using this 
method are also included in the invention. 
20 In another aspect, the invention features a substantially pure antibody that 

specifically recognizes and binds to an SSE polypeptide or a portion thereof. In preferred 
embodiments, the antibody specifically recognizes and binds to a recombinant SSE 
polypeptide or a portion thereof. 

In another aspect, the invention features a method of isolating an SSE gene or 
25 fragment thereof, the method including the steps of: (a) contacting the nucleic acid 

molecule of Fig. 2 A (SEQ ID NO:l) or a portion thereof with a nucleic acid preparation 
from a plant cell under hybridization conditions providing detection of nucleic acid 
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sequences having at least 30% or greater sequence identity to the nucleic acid sequence of 
Fig. 2A (SEQ ID NO:l); and (b) isolating the hybridizing nucleic acid sequences. 

In still another aspect, the invention features a method of isolating an SSE 
gene or fragment thereof, the method including the steps of: (a) providing a sample of 
5 plant cell DNA; (b) providing a pair of oligonucleotides having sequence identity to a 
region of the nucleic acid of Fig. 2A (SEQ ID NO:l); (c) contacting the pair of 
oligonucleotides with the plant cell DNA under conditions suitable for polymerase chain 
reaction-mediated DNA amplification; and (d) isolating the amplified SSE gene or 
fragment thereof. In preferred embodiments, the amplification step is carried out using a 
10 sample of cDNA prepared from a plant cell. In still other preferred embodiments, the pair 
of oligonucleotides used in the amplification step are based on a sequence encoding an 
D SSE polypeptide, wherein the SSE polypeptide is at least 30% identical to the amino acid 

J sequence of Fig. 2B (SEQ ID NO:2). 

In another aspect, the invention features a method for modifying or altering 
y 15 the biosynthesis of a storage reserve in a transgenic plant or transgenic plant component, 
*g the method including the steps of: (a) introducing into a plant cell a transgene including 

5 DNA encoding an SSE polypeptide having at least 20% identity to the SSE1 polypeptide 

p x (SEQ ID NO: 2) operably linked to a promoter functional in plant cells to yield a 

C transformed plant cells; and (b) regenerating a transgenic plant or transgenic plant 

m ' 20 component from said transformed cells, wherein the SSE polypeptide is expressed in the 
cells of said transgenic plant or transgenic plant component, thereby modifying or 
altering the seed storage reserve of said transgenic plant or transgenic plant component. 
In preferred embodiments, the storage reserve material is a lipid, a storage protein, or a 
carbohydrate (e.g., a starch). In preferred embodiments, the expressed polypeptide is 
25 Pexl6 (SEQ ID NO:6). In other preferred embodiments, the storage reserve is a seed or 
vegetative storage reserve material. 

In another aspect, the invention features a method for modifying or altering 
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the biosynthesis of a storage reserve in a transgenic plant cell, the method including 
reducing the level of an SSE polypeptide (or expression of an SSE gene) in a transgenic 
plant or transgenic plant component. In preferred embodiments, the method for reducing 
the level of the SSE polypeptide includes expressing an antisense SSE nucleic acid 
5 sequence in the transgenic plant or transgenic plant component. In other preferred 
embodiments, the method for reducing the level of an SSE polypeptide includes co- 
suppression of an SSE nucleic acid sequence in the transgenic plant or transgenic plant 
component. Preferably, the storage reserve material is a lipid, a storage protein, or a 
carbohydrate (e.g., a starch). In yet other preferred embodiments, the storage reserve is a 

10 seed or vegetative storage reserve material. 

In yet another aspect, the invention features a process for modifying storage 
protein production in a transgenic plant or transgenic plant component, the method 
including the steps of: (a) introducing into plant cells a transgene encoding an SSE 
polypeptide operably linked to a promoter functional in the plant cells to yield 

1 5 transformed plant cells; and (b) regenerating a transgenic plant or transgenic plant 

component from the transformed plant cells, wherein the SSE polypeptide is expressed in 
the cells of the transgenic plant or transgenic plant component, thereby modifying storage 
protein production in the transgenic plant or transgenic plant component. In preferred 
embodiments, the transgene encoding the SSE polypeptide is overexpressed. In other 

20 preferred embodiments, the transgene encoding the SSE polypeptide is constitutively 
expressed, is inducibly expressed, or is expressed in a tissue-specific, cell-specific, or 
organ-specific manner. Preferably, storage protein production is increased relative to an 
untransformed control plant or plant component. 

In a related aspect, the invention also features a process for modifying storage 

25 protein production in a transgenic plant or transgenic plant component, the method 

including the steps of: (a) introducing into plant cells a transgene encoding an antisense 
coding sequence of an SSE polypeptide operably linked to a promoter functional in the 
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plant cells to yield transformed plant cells; and (b) regenerating a transgenic plant or 
transgenic plant component from the transformed plant cells, wherein the antisense 
coding sequence of the SSE polypeptide is expressed in the cells of the transgenic plant 
or transgenic plant component, thereby modifying storage protein production in the 

5 transgenic plant or transgenic plant component. In preferred embodiments, the transgene 
encoding an antisense coding sequence of an SSE polypeptide is overexpressed. In other 
preferred embodiments, the transgene encoding an antisense coding sequence of an SSE 
polypeptide is constitutively expressed. Preferably, the transgene encodes an antisense 
coding sequence of an SSE polypeptide is inducibly expressed or is expressed in a tissue- 

10 specific, cell-specific, or organ-specific manner. Preferably, storage protein production is 
decreased relative to an untransformed control plant or plant component. 

In another aspect, the invention features a process for modifying storage lipid 
production in a transgenic plant or transgenic plant component, the method including the 
steps of: (a) introducing into plant cells a transgene encoding an SSE polypeptide 

1 5 operably linked to a promoter functional in the plant cells to yield transformed plant cells; 
and (b) regenerating a transgenic plant or transgenic plant component from the 
transformed plant cells, wherein the SSE polypeptide is expressed in the cells of the 
transgenic plant or transgenic plant component, thereby modifying storage lipid 
production in the transgenic plant or transgenic plant component. In preferred 

20 embodiments, the transgene encoding the SSE polypeptide is overexpressed. In other 
preferred embodiments, the transgene encoding the SSE polypeptide is constitutively 
expressed, is inducibly expressed, or is expressed in a tissue-specific, cell-specific, or 
organ-specific manner. Preferably, storage lipid production is increased relative to an 
untransformed control plant or plant component. 

25 In another aspect, the invention features a process for modifying storage lipid 

production in a transgenic plant or transgenic plant component, the method including the 
steps of: (a) introducing into plant cells a transgene encoding an antisense coding 
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sequence of an SSE polypeptide operably linked to a promoter functional in the plant 
cells to yield transformed plant cells; and (b) regenerating a transgenic plant or transgenic 
plant component from the transformed plant cells, wherein the antisense coding sequence 
of the SSE polypeptide is expressed in the cells of the transgenic plant or transgenic plant 

5 component, thereby modifying storage lipid production in the transgenic plant or 
transgenic plant component. In preferred embodiments, the transgene encoding an 
antisense coding sequence of an SSE polypeptide is overexpressed. In other preferred 
embodiments, the transgene encoding an antisense coding sequence of an SSE 
polypeptide is constitutively expressed. Preferably, the transgene encodes an antisense 

10 coding sequence of an SSE polypeptide is inducibly expressed or is expressed in a tissue- 
specific, cell-specific, or organ-specific manner. Preferably, storage lipid production is 
decreased relative to an untransformed control plant or plant component. 

In another aspect, the invention features a process for modifying storage 
carbohydrate production in a transgenic plant or transgenic plant component, the method 

15 including the steps of: (a) introducing into plant cells a transgene encoding an SSE 
polypeptide operably linked to a promoter functional in the plant cells to yield 
transformed plant cells; and (b) regenerating a transgenic plant or transgenic plant 
component from the transformed plant cells, wherein the SSE polypeptide is expressed in 
the cells of the transgenic plant or transgenic plant component, thereby modifying storage 

20 carbohydrate production in the transgenic plant or transgenic plant component. In 

preferred embodiments, the transgene encoding the SSE polypeptide is overexpressed. In 
other preferred embodiments, the transgene encoding the SSE polypeptide is 
constitutively expressed, is inducibly expressed, or is expressed in a tissue-specific, cell- 
specific, or organ-specific manner. Preferably, storage carbohydrate production (e.g., 

25 starch production) is decreased relative to an untransformed control plant or plant 
component. 

In another aspect, the invention features a process for modifying storage 
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carbohydrate production in a transgenic plant or transgenic plant component, the method 
including the steps of: (a) introducing into plant cells a transgene encoding an antisense 
coding sequence of an SSE polypeptide operably linked to a promoter functional in the 
plant cells to yield transformed plant cells; and (b) regenerating a transgenic plant or 

5 transgenic plant component from the transformed plant cells, wherein the antisense 
coding sequence of the SSE polypeptide is expressed in the cells of the transgenic plant 
or transgenic plant component, thereby modifying storage carbohydrate production in the 
transgenic plant or transgenic plant component. In other preferred embodiments, the 
transgene encoding an antisense coding sequence of an SSE polypeptide is constitutively 

10 expressed. Preferably, the transgene encodes an antisense coding sequence of an SSE 
polypeptide is inducibly expressed or is expressed in a tissue-specific, cell-specific, or 
organ-specific manner. Preferably, storage carbohydrate production (e.g., starch 
production) is increased relative to an untransformed control plant or plant component. 

In another aspect, the invention features a process for modifying dessication 

1 5 tolerance of a transgenic plant or transgenic plant component, the method including the 
steps of: (a) introducing into plant cells a transgene encoding an antisense coding 
sequence of an SSE polypeptide operably linked to a promoter functional in the plant 
cells to yield transformed plant cells; and (b) regenerating a transgenic plant or transgenic 
plant component from the transformed plant cells, wherein the antisense coding sequence 

20 of the SSE polypeptide is expressed in the cells of the transgenic plant or transgenic plant 
component, thereby modifying dessication tolerance of the transgenic plant or transgenic 
plant component. In preferred embodiments, the dessication tolerance of the transgenic 
plant or transgenic plant component is increased relative to an untransformed control 
plant or plant component. 

25 By "shrunken seed" gene or "SSE" gene is meant a gene encoding a 

polypeptide that governs or regulates protein and oil body biogenesis in a plant cell. SSE 
genes may be identified and isolated from any plant species, especially agronomically 
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important crop plants, using any of the sequences disclosed herein in combination with 
conventional methods known in the art. 

By "polypeptide" is meant any chain of amino acids, regardless of length or 
post-translational modification (for example, glycosylation or phosphorylation). 
5 By "substantially identical" is meant a polypeptide or nucleic acid exhibiting 

at least 30%, preferably 50%, more preferably 80%, and most preferably 90%, or even 
95% homology to a reference amino acid sequence (for example, the amino acid 
sequence shown in Fig. 2B (SEQ ID NO:2) or nucleic acid sequence (for example, the 
nucleic acid sequences shown in Fig. 2A (SEQ ID NO:l)). For polypeptides, the length 
10 of comparison sequences will generally be at least 16 amino acids, preferably at least 20 
amino acids, more preferably at least 25 amino acids, and most preferably 35 amino acids 
y or greater. For nucleic acids, the length of comparison sequences will generally be at 

p least 50 nucleotides, preferably at least 60 nucleotides, more preferably at least 75 

Lj nucleotides, and most preferably 1 10 nucleotides or greater. 

4- 15 Sequence identity is typically measured using sequence analysis software (for 

example, Sequence Analysis Software Package of the Genetics Computer Group, 
% University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 

3 53705, BLAST, or PILEUP/PRETTYBOX programs). For example, such software when 

3 set to standard parameters matches identical or similar sequences by assigning degrees of 

20 homology to various substitutions, deletions, and/or other modifications. Conservative 
substitutions typically include substitutions within the following groups: glycine alanine; 
valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, 
threonine; lysine, arginine; and phenylalanine, tyrosine. 

By a "substantially pure polypeptide" is meant an SSE polypeptide (for 
25 example, an SSE polypeptide such as SSE1 (SEQ ED NO:2)) that has been separated from 
components which naturally accompany it. Typically, the polypeptide is substantially 
pure when it is at least 60%, by weight, free from the proteins and naturally-occurring 
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organic molecules with which it is naturally associated. Preferably, the preparation is at 
least 75%, more preferably at least 90%, and most preferably at least 99%, by weight, an 
SSE polypeptide. A substantially pure SSE polypeptide may be obtained, for example, 
by extraction from a natural source (for example, a plant cell); by expression of a 
5 recombinant nucleic acid encoding an SSE polypeptide; or by chemically synthesizing 
the protein. Purity can be measured by any appropriate method, for example, column 
chromatography, polyacrylamide gel electrophoresis, or by HPLC analysis. 

By "derived from" or "obtained from" is meant isolated from or having the 
sequence of a naturally-occurring sequence (e.g., a cDNA, genomic DNA, synthetic, or a 
1 0 combination thereof). 

By "isolated nucleic acid molecule" is meant a DNA molecule that is free of 
; tf the genes which, in the naturally-occurring genome of the organism from which the DNA 

j* of the invention is derived, flank the gene. The term therefore includes, for example, a 

;i| recombinant DNA that is incorporated into a vector; into an autonomously replicating 

'Nj 1 5 plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote; or that exists as 
« a separate molecule (for example, a cDNA or a genomic or cDNA fragment produced by 

5 PCR or restriction endonuclease digestion) independent of other sequences. It also 

;=? includes a recombinant DNA which is part of a hybrid gene encoding additional 

O polypeptide sequence. 

20 By "hybridizes specifically" is meant that a nucleic acid sequence hybridizes 

to a DNA sequence at least under low stringency conditions as described herein, and 
preferably under high stringency conditions, also as described herein. 

By "antisense SSE" is meant a nucleotide sequence that is complementary to 
an SSE (or SSE homolog) messenger RNA. In general, such an antisense sequence will 
25 usually be at least 15 nucleotides, preferably about 15-200 nucleotides, and more 
preferably 200-2,000 nucleotides in length. The antisense sequence may be 
complementary to all or a portion of the SSE or SSE homolog mRNA nucleotide 
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sequence (for example, the SSE1 gene), and, as appreciated by those skilled in the art, the 
particular site or sites to which the antisense sequence binds as well as the length of the 
antisense sequence will vary, depending upon the degree of inhibition desired and the 
uniqueness of the antisense sequence. A transcriptional construct expressing an SSE 

5 antisense nucleotide sequence includes, in the direction of transcription, a promoter, the 
sequence coding for the antisense RNA on the sense strand, and a transcriptional 
termination region. Antisense SSE sequences may be constructed and expressed 
according to standard methods, for example, in van der Krol et al., Gene 72:45, 1988; 
Rodermel et al, Cell 55:673, 1988; Mol et al., FEBS Lett. 268:427, 1990; Weigel and 

10 Nilsson, Nature 377: 495, 1995; Cheung et al., Cell 82:383, 1995; and U.S. Pat. No. 
5,107,065. 

By "transformed cell" is meant a cell into which (or into an ancestor of which) 
has been introduced, by means of recombinant DNA techniques, a DNA molecule 
encoding (as used herein) an SSE polypeptide. 

1 5 By "positioned for expression" is meant that the DNA molecule is positioned 

adjacent to a DNA sequence which directs transcription and translation of the sequence 
(i.e., facilitates the production of, for example, an SSE polypeptide, a recombinant 
protein, or an RNA molecule). 

By "reporter gene" is meant a gene whose expression may be assayed; such 

20 genes include, without limitation, p-glucuronidase (GUS), luciferase, chloramphenicol 
transacetylase (CAT), green fluorescent protein (GFP), fi-galactosidase, herbicide 
resistant genes and antibiotic resistance genes. 

By "expression control region" is meant any minimal sequence sufficient to 
direct transcription. Included in the invention are promoter elements that are sufficient to 

25 render promoter-dependent gene expression controllable for cell-, tissue-, or organ- 
specific gene expression, or elements that are inducible by external signals or agents (for 
example, light-, pathogen-, wound-, stress-, or hormone-inducible elements or chemical 
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inducers); such elements may be located in the 5' or 3' regions of the native gene or 
engineered into a transgene construct. 

By "operably linked" is meant that a gene and a regulatory sequence(s) are 
connected in such a way as to permit gene expression when the appropriate molecules 
5 (for example, transcriptional activator proteins) are bound to the regulatory sequence(s). 
By "plant cell" is meant any self-propagating cell bounded by a semi- 
permeable membrane and containing a plastid. Such a cell also requires a cell wall if 
further propagation is desired. Plant cell, as used herein includes, without limitation, 
algae, cyanobacteria, seeds, suspension cultures, embryos, meristematic regions, callus 
10 tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. 

By "crucifer" is meant any plant that is classified within the Cruciferae family. 
The Cruciferae include many agricultural crops, including, without limitation, rape (for 
example, Brassica campestris and Brassica napus\ broccoli, cabbage, brussel sprouts, 
radish, kale, Chinese kale, kohlrabi, cauliflower, turnip, rutabaga, mustard, horseradish, 
1 5 and Arabidopsis. 

By "transgene" is meant any piece of DNA which is inserted by artifice into a 
cell, and becomes part of the genome of the organism which develops from that cell. 
Such a transgene may include a gene which is partly or entirely heterologous (i.e., 
foreign) to the transgenic organism, or may represent a gene homologous to an 
20 endogenous gene of the organism. 

By "transgenic" is meant any cell which includes a DNA sequence which is 
inserted by artifice into a cell and becomes part of the genome of the organism which 
develops from that cell. As used herein, the transgenic organisms are generally 
transgenic plants and the DNA (transgene) is inserted by artifice into the nuclear or 
25 plastidic genome. A transgenic plant according to the invention may contain one or more 
acquired resistance genes. 

By "detectably-labelled" is meant any direct or indirect means for marking 
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and identifying the presence of a molecule, for example, an oligonucleotide probe or 
primer, a gene or fragment thereof, or a cDNA molecule or a fragment thereof. Methods 
for detectably-labelling a molecule are well known in the art and include, without 
limitation, radioactive labelling (for example, with an isotope such as 32 P or 35 S) and 

5 nonradioactive labelling (for example, chemiluminescent labelling, for example, 
fluorescein labelling). 

By "purified antibody" is meant antibody which is at least 60%, by weight, 
free from proteins and naturally-occurring organic molecules with which it is naturally 
associated. Preferably, the preparation is at least 75%, more preferably 90%, and most 

10 preferably at least 99%, by weight, antibody, for example, an acquired resistance 

polypeptide-specific antibody. A purified SSE antibody may be obtained, for example, 
by affinity chromatography using a recombinantly-produced acquired resistance 
polypeptide and standard techniques. 

By "specifically binds" is meant an antibody which recognizes and binds an 

15 SSE protein but which does not substantially recognize and bind other molecules in a 
sample, for example, a biological sample, which naturally includes an SSE protein such 
asSSEL 

As discussed above, a fundamental gene that is responsible for protein and oil 
body biogenesis has been identified. Accordingly, the invention provides a number of 

20 important advances and advantages for engineering plant storage reserves, including seed 
and vegetative reserve storage material. For example, by providing SSE genes as 
described herein that are readily incorporated and expressed in all species of plants, the 
invention facilitates an effective and economical means for producing plants having 
increased nutritional value. In addition, because plants expressing an SSE gene are 

25 dessication tolerant, the invention further provides for increased production efficiency, as 
well as for improvements in quality and yield of crop plants and ornamentals. Thus, the 
invention contributes to the production of high quality and high yield agricultural 
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products: for example, fruits, ornamentals, vegetables, legumes, cereals and field crops. 

The invention is also useful for providing nucleic acid and amino acid 
sequences of an SSE gene that facilitates the isolation and identification of SSE genes 
from any plant species. 
5 Other features and advantages of the invention will be apparent from the 

following description of the preferred embodiments thereof, and from the claims. 

Detailed Description 
The drawings will first be described. 

Drawings 

10 Figure 1 is a panel of photographs showing the abnormal storage deposition 

and the shrunken phenotype of ssel seeds. Figures 1 A and IB are transmission electron 
micrographs of a representative cell from the wild-type cotyledon and hypocotyl, 
respectively. Figures 1C and ID are transmission electron micrographs of a 
representative cell from the ssel cotyledon and hypocotyl, respectively. Wild-type cells 

15 are filled with numerous oil bodies (OB) and a few large protein bodies (PB). ssel cells 
contained few oil bodies and additional structures such as starch granules (St), vacuoles 
(Vc), stacks of membranes (M), and vesicles (Vs). Figures IE and IF show photographs 
of wild-type C24 and ssel seeds, respectively. The magnification bar found in Figs. 1 A- 
1D is 3.1 yM. For histological examination, mature wild-type C24 and once backcrossed 

20 ssel seeds were used, ssel seeds were imbibed in water for 20 minutes before 
processing. Seeds were cut into halves and fixed in 2.5% glutaraldehyde/0.1 M 
cacodylate buffer (pH 7.2), post-fixed in 1% osmium tetroxide, dehydrated in an ethanol 
series, and embedded in Spurr's resin. Thin sections were then stained with uranyl 
acetate and observed under a transmission electron microscope. 

25 Figure 2A shows the cDNA sequence (SEQ ID NO: 1) of SSE1 . The ATG 

start codon and TGA stop codon of the SSE1 gene are located at positions 122 and 1223, 
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respectively. 

Figure 2B shows the predicted amino acid sequence of SSE1 (SEQ ID NO:2) 
encoded by the cDNA shown in Fig. 2A. Hydrophobic (single line) and hydrophilic 
(double line) regions of the polypeptide are underlined. 
5 Figure 3A is a PCR diagram showing that primers A and B amplify an -0.9- 

kb fragment from the wild-type SSE1 allele, and primers C and B amplify an ~1.6-kb 
fragment from the T-DNA interrupted ssel allele. 

Figure 3B shows the single seed PCR results of round (R) and shrunken (S) 
seeds in a F 2 population derived from a backcross between a T2 and a wild-type plant. 
10 Figure 3C shows the representative single seed PCR results of round (R) 

(n=13) and shrunken (S) (n=21) T4 seeds. A control reaction from a wild-type (WT) seed 
*£? is also shown. 

]g Figure 4A shows the amino acid sequence alignment of SSE1 (SEQ ID NO:2) 

andPexl6p (SEQIDNO:6). Dots indicate gaps. Identical residues are boxed. 
I 1 5 Hydrophobic (single line) and hydrophilic (double line) regions for both proteins are 
underlined (Kyte and Doolittle, J. Mol. Biol. 157:105, 1982). The predicted 
glycosylation site of SSE1 is indicated with an asterisk. Single-letter abbreviations for 
O the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, 

Q His; I, He; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, 

U 20 Val; W, Trp; and Y, Tyr. 

Figure 4B shows the phenotype of seeds obtained from ssel plants expressing 
the SSE1 transgene. 

Figure 4C shows the SSE1 complementation of pexl6 mutants pexl6-l and 
P16KO-8A (Eitzen et al., J. Cell Biol. 137:1265, 1997) for growth on oleic acid as sole 
25 carbon source. SSE1 cDNA was cloned into the EcoRI site of a Y. lipolytica shuttle 
vector pTc3 between the promoter and the terminator regions of Y. lipolytica thiolase 
gene. Ura + transformants of pex 16-1 and P16KO-8A were obtained as described by 
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Eitzen et al. {supra). El 22 is the wild-type strain. 

Figure 4D shows SSE1 complementation of pexl 6-1 mutant for the dimorphic 
transition from yeast to mycelia form. Cells were grown at 30°C in YND liquid medium 
(Eitzen et al., supra). The SSE1 transformant underwent dimorphic transition at a lower 
frequency than the wild-type strain El 22. 

Figure 5 shows the results of competitive RT-PCR analyses of SSE1 
expression profiles. RNA was isolated from flowers before (B), on the day (0), or 1 day 
after pollination (1); from siliques 3 to 21 days after pollination; from cotyledons of 
2-day-old seedlings; and from expanding rosette leaves and roots. An equal amount of 
competitor cDNA template was included in each reaction. The SSE1 target 
(T)-to-competitor (C) cDNA ratios reflect the relative expression levels of the SSE1 gene. 

Abnormal Storage Deposition and the Shrunken Phenotype of ssel Seeds 

In Arabidopsis, proteins and lipids are the major reserves in mature seeds 
(Mansfield and Briarty, Can. J. Bot. 70: 151, 1992; Figs. 1A and IB). To study the 
deposition of these reserves, a shrunken seed 1 (ssel) mutant that alters this seed storage 
profile by accumulating starch over proteins and lipids was isolated (Figs. 1C and ID). 
The cotyledon and the hypocotyl cells of ssel contained no recognizable protein bodies 
and few oil bodies. Starch granules, membrane stacks, vesicles, and vacuoles, all of 
which were absent in wild type cells, were present in ssel cells, and the oil bodies in ssel 
contained higher electron density substances than the wild type. Additionally, the ssel 
seeds were observed to shrink upon desiccation (a likely consequence of insufficient 
deposition of storage molecules), whereas the wild type seeds were desiccation tolerant 
(Figs. IE and IF). 

The ssel mutant was identified in a transferred DNA (T-DNA) transgenic line 
(T line) that exhibited the shrunken seed phenotype as follows. The cDNA of the 
Arabidopsis prohibitin gene Atphbl (Genbank Accession Number: U66591) in an 
antisense orientation was inserted into pBI121 (Clontech, LaJolla, CA) between the SacI 
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and BamHI sites to replaced the P-glucuronidase coding region. This construct was then 
used to transform Arabidopsis thaliana C24 according to standard methods. 
Approximately 2% of the C24 transgenic lines, resulting from transformation 
experiments, showed the shrunken seed phenotype (ssel). In addition, Northern blot 
5 analysis with an Atphbl cDNA bottom strand probe showed that the Atphbl mRNA level 
in ssel was similar to that of the wild type. 

Furthermore, among the T2 seeds on the Tl plant (the primary transgenic 
plant), 90% of the seeds were shrunken and 10% were normally rounded. The shrunken 
seeds were not viable, and plants grown from the round seeds produced -90% shrunken 
10 seeds. Propagation of the transgenic line by self-pollination for 4 generation (to T5) 
showed that this pattern of inheritance continued for generations. However, after 
B backcrossing the T2 line to wild-type plants, ssel was observed to behave as a typical 

single recessive Mendelian gene. This was shown by reciprocally crossing T2 plants 
derived from round seeds to wild-type C24 plants. All Fj seeds resulting from these 
15 crosses were round. The numbers of shrunken/round F 2 seeds in six single siliques were 
1 1/39, 15/36, 12/39, 10/42, 1 1/36, and 10/40; these numbers were consistent with an 
expected segregation ratio of 1:3 (x 2 = 0.64, P > 0.1). 
^ Moreover, when the segregation patterns of F 3 families were observed, ssel 

cosegregated with the T-DNA. In these experiments, two hundred and sixty F 3 seed 
20 families were obtained from individual F2 round seeds, and one hundred and eighty 
families segregated for both the shrunken and kanamycin resistant (conferred by the 
T-DNA) phenotypes, whereas the other 80 showed neither phenotype. 
Cloning of a cDNA Encoding the SSE1 Gene 

The SSE1 gene and its cDNA were cloned as follows. A 136-bp genomic 
25 DNA fragment flanking the T-DNA was isolated by the thermal asymmetric 

interlaced-polymerase chain reaction according to the methods described by Liu et al. 
{Plant J. 8: 457-463, 1995) and used as a probe for screening a genomic library. A 17-kb 
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genomic fragment isolated from a A-FIXII Arabidopsis C24 genomic library was then used 
as a probe to screen an Arabidopsis seedling cDNA library which was prepared according 
to the methods described by Minet et al. (Plant J, 2:417, 1992). Two SSE1 cDNA clones 
were subsequently identified, and DNA sequencing of these clones revealed that both 
5 have identical 5' ends and both included stop codons. The V polyadenylation site was 
determined by 3' rapid amplification of cDNA ends (RACE) polymerase chain reaction 
(PCR) according to standard methods. The SSE1 cDNA sequence (SEQ ID NO:l) and 
its predicted amino acid sequence (SEQ ID NO:2) are shown in Figures 2A and 2B ? 
respectively. SSE1 was found to reside within the BAC clones F17K2 and F4I18 
10 (GenBank Accession Numbers: AC003680 and AC004665, respectively). The SSE1 
protein predicted by the open reading frame was found to differ from the F17K2.22 
hypothetical protein due to discrepancies between the predicted and the actual splicing 
sites. 

Genotype Determination by Single-Seed Polymerase Chain Reaction 
15 The SSE1 sequences obtained were then used to design three primers for 

determining the genotypes of shrunken and round seeds by single seed polymerase chain 
reaction (PCR) (Fig. 3 A). These experiments were performed as follows. DNA was 
isolated from single embryos after removal of the seed coat, which had the same 
genotype as the parent. A single embryo was then transferred to an eppendorf tube 
20 containing 100 ml of grinding buffer, which consisted of 4 parts of homogenization 

buffer (100 mM NaCl, 200 mM sucrose, 100 mM EDTA, and 30 mM Tris-HCl; pH=8.0) 
and 1 part of phage lysis buffer (250 mM EDTA, 50 mM Tris-HCl, and 2.5% SDS; 
pH=9.2), and the embryo was ground with an eppendorf pestle. After incubation at 65°C 
for 20 minutes, 33.3 ml of 3 M potassium acetate (pH 5.2) was added and the reaction 
25 was incubated on ice for 20 minutes. The mixture was centrifuged and the supernatant 
was mixed with 175 ml ethanol to precipitate the DNA. The DNA was dissolved in 20 
ml of water and 1 ml of DNA was used in a 20 ml PCR reaction. As shown in Fig 3 A, 
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primer A (5 ' - ATCAGAGATTGATTT AACGTA-3 ' ; SEQ ID N0:3) and B 
(5 ' -ACGATTTTCAATTATGTGTTC-3 ' ; SEQ ID N0:4) recognized SSE1 gene and 
amplified an ~0.9-kb fragment from the wild-type SSE1 allele. Primer C 
(5 ' -CGCTTGGTCGGTCATTTCG-3 ' ; SEQ ID NO:5) recognized the neomycin 
5 phosphotransferase gene in the T-DNA. Primers B and C therefore amplified an ~1.6-kb 
fragment from the mutant ssel allele. The PCR was performed with an annealing 
temperature of 56°C. 

The results of the single-seed PCR experiments showed that 66% (n=6) of 
round F 2 seeds were heterozygous and 33% were homozygous for the wild-type allele, 
1 0 whereas 1 00% (n=5) of shrunken F 2 seeds were homozygous for the T-DNA insertion 
(Fig. 3B). T4 seeds were also analyzed. All round seeds (n=13) were heterozygous and 
; |: all shrunken seeds (n=21) were homozygous for the T-DNA insertion (Fig. 3C). Thus, 

p ssel is recessive, and in the self-pollinated T line the mutant allele is transmitted at a 

Jif higher frequency than the wild-type allele. For this analysis, Fj seeds were obtained from 

H 15 reciprocal crosses between a round seed-derived T3 plant and a wild-type plant. Their 
iS genotypes were then examined to determine the genotypes of the gametes from the T3 

% plant. The wild-type allele was present in 3 out of 39 eggs and 0 out of 5 1 sperms; the 

O rest of the gametes carried the mutant allele. Therefore, the T4 progeny would be either 

Q homozygous ssel (36/39=92%) or heterozygous (3/39=8%). 

^ 20 Amino Acid Sequence Analysis and SSE1 Complementation of the 

Arabidopsis ssel and 7. lipolytica pex!6 Mutants 

The SSE1 cDNA encodes a predicted protein of 367 amino acids (SEQ ID 
NO:2; Figs. 2B and 4A). Expression of SSE1 cDNA in transgenic ssel plants was found 
to complement the shrunken seed phenotype (Fig. 4B). ssel plants were complemented 
25 with SSE1 as follows. The SSE1 cDNA was fused with the 35S promoter (35SP) and the 
nopaline synthase 3 ' region (NOS 3 *). The 35SP-SSE1-NOS3 ' cassette was subcloned 
into the Kpnl site of the pLVN19R binary vector to make the pLVN19R-SSEl construct. 
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T3 plants from round seeds were then vacuum infiltrated with Agrobacterium 
tumefaciens strain GV3101 (Bechtold et al., C. R. Acad. Sci. Paris Life Sci. 316: 1194, 
1993) carrying pLVN19R-SSEl . Genotypes of seven methotrexate resistant transgenic 
plants were determined by PCR and six were found to be homozygous ssel. Four 
5 transgenic ssel plants were fertile and produced complemented T2 seeds at 67 to 87%. 

Similar to wild type, transgenic ssel seeds expressing SSE1 were tolerant of 
desiccation, and cells were filled with storage proteins and lipids, but lacked starch. As 
shown in Fig. 4A, the SSE1 sequence showed similarity to Pexl6p, a membrane 
associated protein required for the assembly and proliferation of peroxisomes (Eitzen et 
10 al, supra) and for the trafficking of plasma membrane and cell wall associated proteins 
(Titorenko et al., Mol. Cell. Biol. 17:5210, 1997), in the yeast Y. lipolytics Pexl6p is 
fg glycosylated and transiently localized in the endoplasmic reticulum (ER) (Titorenko and 

J Rachubinski, supra). Despite the limited amino acid sequence similarity (26% identity), 

the two proteins have similar arrangements of their hydrophobic and hydrophilic regions 
H 15 (Fig. 4A). A predicted glycosylation site was found in SSE1 (Fig. 4A). SSE1 was also 
\~ found to complement the growth of pex 16 mutants on oleic acid as sole carbon source 

'% (Fig. 4C); indicating restoration of peroxisomal function (Eitzen et al., supra). The 

□ restoration of limited growth of the disruption allele P16KO-8A (Eitzen et al., supra) 

Q indicated that SSE1 cannot fully replace Pexl6p in peroxisome formation, probably due 

20 to the functional difference(s) between the two proteins. In addition, SSE1 partially 

complemented the pex 16-1 mutant for the dimorphic transition from yeast to the mycelia 
form (Fig. 4D). Pexl6p is normally required for mycelia phase specific cell surface 
protein transport. 

Peroxisomes are not generally found in dry seeds (Olsen and Harada, Annu. 
25 Rev. Plant Physiol. Plant Mol. Biol. 46:123, 1995 and references cited therein; Fig. 1). 
Protein and oil bodies are the most abundant organelles in mature Arabidopsis seeds and 
the formation of both is ER-dependent (Mansfield and Briarty, supra; Chrispeels, Annu. 
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Rev. Plant Physiol. Plant Mol. Biol. 42,21, 1991; Sarmiento et al., Plant J. 11:783, 1997; 
and Huang, Plant Physiol. 1 10:1055, 1996). By analogy to the function of Pexl6p's in 
peroxisome assembly and cell surface protein transport, SSE1 could participate in oil 
body formation and storage protein delivery. The vesicles and stacks of membranes in 
5 the ssel cells (Fig. ID) resemble the subcellular structures in the pexl6-l mutant of Y. 
lipolytica (Eitzen et al., supra). The similarities between oil body and peroxisome 
biogenesis are consistent with their related functions in germinating seedlings of 
fat-metabolizing plants, where oil bodies are broken down by glyoxysomes (Olsen and 
Harada, supra). 

10 Competitive RT-PCR Analyses of SSE1 Expression Profiles 

SSE1 gene expression was also analyzed by competitive reverse 
transcription-polymerase chain reaction (RT-PCR). The amount of SSE1 mRNA 
obtained from different tissues and organs was determined as the target-to-competitor 
cDNA ratio as follows. After deoxyribonuclease treatment, 1 mg RNA was reverse 

15 transcribed in a 20 ml reaction, with 0.4 mM of the SSE1 specific primer FP15R 

(5 '-GGCAATATTCTTCCGTTGC-3 ' ; SEQ ID NO:7). Subsequently, 1 ml of the reverse 
transcription mixture and 5 X 10" 21 mol of competitor cDNA were used in each 20 ml 
PCR reaction. The competitor cDNA was identical to the SSE1 cDNA (designated target 
cDNA) except for a 95-bp internal deletion from the EcoRI to the Ncol site. The primers 

20 FP7 (5 ' - AAAAATGGAACT AC ATT ATTCTC-3 ' ; SEQ ID NO:8) and FP14R 

(5 7 - ATAAGTAAAACGCTTAACCTHC-3 ' ; SEQ ID NO:9) amplify 814- and 719-bp 
fragments respectively, from the target and the competitor cDNAs. The ratio of the two 
PCR products reflected the relative amount of SSE1 cDNA (or mRNA) in each sample 
(Siebert and Larrick, Nature 359:557, 1992). The results of these experiments are 

25 depicted in Figure 5 . 

SSE1 steady state mRNA level in the siliques increased during seed 
maturation to a maximum in mature 19- and 21-day-old brown siliques. The level of 
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mRNA was also high in cotyledons of germinating seedlings and flowers, but low in 
expanding leaves and roots. Glyoxysomes are assembled in germinating seedlings (Olsen 
and Harada, supra); therefore SSE1 is likely to be required in this process. The low 
expression in expanding leaves, where leaf peroxisomes are formed, may be due to low 
5 peroxisomes abundance. Alternatively, SSE1 may not normally be involved in 

peroxisome/glyoxysome formation; rather its expression in germinating seedlings may be 
required for maintenance of the remaining oil bodies. The high expression levels in 
flowers suggests additional functions of SSE1, possibly the formation of oil body like 
organelles in tapetum and pollen (Huang, supra). 

1 0 Efficient use of limited amounts of assimilates for seed storage deposition 

requires coordinated metabolic pathways and organelle assembly. In ssel mature 
embryos, cotyledon and hypocotyl cells accumulated excess starch (Fig. 1). The 
functional similarity of SSE1 and Pexl6p argues against SSE1 being a direct inhibitor of 
starch synthesis; rather, it implies that protein and oil body proliferation repress starch 

15 accumulation. Starch accumulation may also be a secondary effect of the lec mutations 
(Meinke et al., Plant Cell 6:1049, 1994). Consistent with the observations in 
Arabidopsis, simultaneous reduction in storage proteins and increase in starch content 
was also observed in a soybean shriveled seed mutant (Chen et al, Am. J. Bot. 85:492, 
1998). Thus, in at least some species of flowering plants, starch accumulation maybe a 

20 default storage deposition pathway during seed development. 
Isolation of Other SSE Genes 

Any cell or tissue can serve as the nucleic acid source for the molecular 
cloning of an SSE gene. Isolation of an SSE gene involves the isolation of those DNA 
sequences which encode a protein exhibiting SSE-associated structures, properties, or 

25 activities, for example, the ability to complement an ssel phenotype. Based on the SSE 
gene and polypeptide described herein, the isolation of additional plant SSE coding 
sequences (e.g., those sequences derived from monocots or dicots) is made possible using 
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standard strategies and techniques that are well known in the art. 

In one particular example, the SSE sequences described herein may be used, 
together with conventional screening methods of nucleic acid hybridization screening. 
Such hybridization techniques and screening procedures are well known to those skilled 
5 in the art and are described, for example, in Benton and Davis, Science 196: 1 80, 1977; 
Grunstein and Hogness, Proc. Natl Acad. Sci., USA 72:3961, 1975; Ausubel et al., 
Current Protocols in Molecular Biology, Wiley Interscience, New York; Berger and 
Kimmel, Guide to Molecular Cloning Techniques, 1987, Academic Press, New York; and 
Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

10 Laboratory Press, New York. In one particular example, all or part of the SSE1 cDNA 
(described herein) may be used as a probe to screen a recombinant plant DNA library for 
genes having sequence identity to the SSE gene. Hybridizing sequences are detected by 
plaque or colony hybridization according to the methods described below. 

Alternatively, using all or a portion of the amino acid sequence of the SSE 

15 polypeptide, one may readily design SSE-specific oligonucleotide probes, including SSE 
degenerate oligonucleotide probes (i.e., a mixture of all possible coding sequences for a 
given amino acid sequence). These oligonucleotides may be based upon the sequence of 
either DNA strand and any appropriate portion of the SSE sequence (Fig. 2A; SEQ ID 
NO: 1). General methods for designing and preparing such probes are provided, for 

20 example, in Ausubel et al, (supra), and Berger and Kimmel, (supra). These 

oligonucleotides are useful for SSE gene isolation, either through their use as probes 
capable of hybridizing to SSE complementary sequences or as primers for various 
amplification techniques, for example, polymerase chain reaction (PCR) cloning 
strategies. If desired, a combination of different oligonucleotide probes may be used for 

25 the screening of a recombinant DNA library. The oligonucleotides may be detectably- 
labeled using methods known in the art and used to probe filter replicas from a 
recombinant DNA library. Recombinant DNA libraries are prepared according to 



-23- 



methods well known in the art, for example, as described in Ausubel et al. (supra), or 
they may be obtained from commercial sources. 

In one particular example of this approach, related SSE sequences having 
greater than 80% identity are detected or isolated using high stringency conditions. High 
5 stringency conditions may include hybridization at about 42 °C and about 50% 

formamide, 0.1 mg/mL sheared salmon sperm DNA, 1% SDS, 2X SSC, 10% Dextran 
sulfate, a first wash at about 65 °C, about 2X SSC, and 1% SDS, followed by a second 
wash at about 65 °C and about 0. IX SSC. Alternatively, high stringency conditions may 
include hybridization at about 42 °C and about 50% formamide, 0.1 mg/mL sheared 
10 salmon sperm DNA, 0.5% SDS, 5X SSPE, IX Denhardt's, followed by two washes at 
room temperature and 2X SSC, 0.1% SDS, and two washes at between 55-60 °C and 
1 0.2XSSC,0.1%SDS. 

U In another approach, low stringency hybridization conditions for detecting 

SSE genes having about 30% or greater sequence identity to the SSE genes described 
M 15 herein include, for example, hybridization at about 42 °C and 0.1 mg/mL sheared salmon 
■ ~ sperm DNA, 1% SDS, 2X SSC, and 10% Dextran sulfate (in the absence of formamide), 

^ and a wash at about 37°C and 6X SSC, about 1% SDS. Alternatively, the low stringency 

S hybridization may be carried out at about 42 °C and 40% formamide, 0.1 mg/mL sheared 

Q salmon sperm DNA, 0.5% SDS, 5X SSPE, IX Denhardf s, followed by two washes at 

^ 20 room temperature and 2X SSC, 0.1% SDS and two washes at room temperature and 0.5X 

SSC, 0.1% SDS. These stringency conditions are exemplary; other appropriate 

conditions may be determined by those skilled in the art. 

If desired, competitive RT-PCR or RNA gel blot analysis of total or poly(A+) 

RNAs isolated from any plant (e.g., those crop plants described herein) may be used to 
25 determine the presence or absence of an SSE transcript using conventional methods. 

As discussed above, SSE oligonucleotides may also be used as primers in 

amplification cloning strategies, for example, using PCR. PCR methods are well known 
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in the art and are described, for example, in PCR Technology, Erlich, ed., Stockton Press, 
London, 1989; PCR Protocols: A Guide to Methods and Applications, Innis et al., eds., 
Academic Press, Inc., New York, 1990; and Ausubel et al. {supra). Primers are 
optionally designed to allow cloning of the amplified product into a suitable vector, for 
5 example, by including appropriate restriction sites at the 5' and 3' ends of the amplified 
fragment (as described herein). If desired, SSE sequences may be isolated using the PCR 
"RACE" technique, or Rapid Amplification of cDNA Ends (see, e.g., Innis et al. (supra)). 
By this method, oligonucleotide primers based on an SSE sequence are oriented in the 3' 
and 5' directions and are used to generate overlapping PCR fragments. These 

10 overlapping 3'- and 5'-end RACE products are combined to produce an intact full-length 
cDNA. This method is described in Innis et al (supra); and Frohman et al., Proc. Natl 
Acad. Set USA 85:8998, 1988. 

Alternatively, any plant cDNA or cDNA expression library may be screened 
by functional complementation of an sse mutant (for example, the ssel mutant described 

1 5 herein) according to standard methods described herein. 

Confirmation of a sequence's relatedness to the SSE polypeptide family may 
be accomplished by a variety of conventional methods including, but not limited to, 
functional complementation assays and sequence comparison of the gene and its 
expressed product. In addition, the activity of the gene product may be evaluated 

20 according to any of the techniques described herein, for example, the functional or 
immunological properties of its encoded product. 

Once an SSE sequence is identified, it is cloned according to standard 
methods and used for the construction of plant expression vectors as described below. 
SSE Polypeptide Expression 

25 SSE polypeptides may be expressed and produced by transformation of a 

suitable host cell with all or part of an SSE cDNA (for example, the SSE cDNA (SEQ ID 
NO:l) described above) in a suitable expression vehicle or with a plasmid construct 
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engineered for increasing the expression of an SSE polypeptide {supra) in vivo. 

Those skilled in the field of molecular biology will understand that any of a 
wide variety of expression systems may be used to provide the recombinant protein. The 
precise host cell used is not critical to the invention. The SSE protein may be produced 
in a prokaryotic host, for example, E. coli, or in a eukaryotic host, for example, 
Saccharomyces cerevisiae, mammalian cells (for example, COS 1 or NTH 3T3 cells), or 
any of a number of plant cells or whole plant including, without limitation, algae, tree 
species, ornamental species, temperate fruit species, tropical fruit species, vegetable 
species, legume species, crucifer species, monocots, dicots, or in any plant of commercial 
or agricultural significance. Particular examples of suitable plant hosts include, but are 
not limited to, conifers, petunia, tomato, potato, pepper, tobacco, Arabidopsis, grape, 
lettuce, sunflower, oilseed rape, flax, cotton, sugarbeet, celery, soybean, alfalfa, 
Medicago, lotus, Vigna, cucumber, carrot, eggplant, cauliflower, horseradish, morning 
glory, poplar, walnut, apple, grape, asparagus, cassava, rice, maize, millet, onion, barley, 
orchard grass, oat, rye, and wheat. 

Such cells are available from a wide range of sources including the American 
Type Culture Collection (Rockland, MD); or from any of a number seed companies, for 
example, W. Atlee Burpee Seed Co. (Warminster, PA), Park Seed Co. (Greenwood, SC), 
Johnny Seed Co. (Albion, ME), or Northrup King Seeds (Harstville, SC). Descriptions 
and sources of useful host cells are also found in Vasil I.K., Cell Culture and Somatic 
Cell Genetics of Plants, Vol I, II, III Laboratory Procedures and Their Applications 
Academic Press, New York, 1984; Dixon, R.A., Plant Cell Culture-A Practical 
Approach, IRL Press, Oxford University, 1985; Green et al., Plant Tissue and Cell 
Culture, Academic Press, New York, 1987; and Gasser and Fraley, Science 244:1293, 
1989. 

For prokaryotic expression, DNA encoding an SSE polypeptide is carried on a 
vector operably linked to control signals capable of effecting expression in the 
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prokaryotic host. If desired, the coding sequence may contain, at its 5' end, a sequence 
encoding any of the known signal sequences capable of effecting secretion of the 
expressed protein into the periplasmic space of the host cell, thereby facilitating recovery 
of the protein and subsequent purification. Prokaryotes most frequently used are various 
5 strains of E. coli; however, other microbial strains may also be used. Plasmid vectors are 
used which contain replication origins, selectable markers, and control sequences derived 
from a species compatible with the microbial host. Examples of such vectors are found in 
Pouwels et aL (supra) or Ausubel et al. (supra). Commonly used prokaryotic control 
sequences (also referred to as "regulatory elements") are defined herein to include 

10 promoters for transcription initiation, optionally with an operator, along with ribosome 
binding site sequences. Promoters commonly used to direct protein expression include 
the beta-lactamase (penicillinase), the lactose (lac) (Chang et al., Nature 198:1056, 1977), 
the tryptophan (Trp) (Goeddel et al., Nucl. Acids Res. 8:4057, 1980), and the tac 
promoter systems, as well as the lambda-derived P L promoter and N-gene ribosome 

15 binding site (Simatake et al, Nature 292:128, 1981). 

One particular bacterial expression system for SSE polypeptide production is 
the E. coli pET expression system (Novagen, Inc., Madison, WI). According to this 
expression system, DNA encoding an SSE polypeptide is inserted into a pET vector in an 
orientation designed to allow expression. Since the SSE gene is under the control of the 

20 T7 regulatory signals, expression of SSE is induced by inducing the expression of T7 
RNA polymerase in the host cell. This is typically achieved using host strains which 
express T7 RNA polymerase in response to IPTG induction. Once produced, 
recombinant SSE polypeptide is then isolated according to standard methods known in 
the art, for example, those described herein. 

25 Another bacterial expression system for SSE polypeptide production is the 

pGEX expression system (Pharmacia). This system employs a GST gene fusion system 
which is designed for high-level expression of genes or gene fragments as fusion proteins 
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with rapid purification and recovery of functional gene products. The protein of interest 
is fused to the carboxyl terminus of the glutathione S-transferase protein from 
Schistosoma japonicum and is readily purified from bacterial lysates by affinity 
chromatography using Glutathione Sepharose 4B. Fusion proteins can be recovered 
5 under mild conditions by elution with glutathione. Cleavage of the glutathione S- 
transferase domain from the fusion protein is facilitated by the presence of recognition 
sites for site-specific proteases upstream of this domain. For example, proteins expressed 
in pGEX-2T plasmids may be cleaved with thrombin; those expressed in pGEX-3X may 
be cleaved with factor Xa. 
10 For eukaryotic expression, the method of transformation or transfection and 

the choice of vehicle for expression of the SSE polypeptide will depend on the host 
.|j system selected. Transformation and transfection methods are described, e.g., in Ausubel 

% et al. (supra); Weissbach and Weissbach, Methods for Plant Molecular Biology, 

Academic Press, 1989; Gelvin et al., Plant Molecular Biology Manual, Kluwer Academic 

L-J 

y 15 Publishers, 1990; Kindle, K., Proc. Natl. Acad. Sci, U.S.A. 87:1228, 1990; Potrykus, L, 
l y Annu. Rev. Plant Physiol. Plant Mol. Biology 42:205, 1991 ; and BioRad (Hercules, CA) 

^ Technical Bulletin #1687 (Biolistic Particle Delivery Systems). Expression vehicles may 

□ be chosen from those provided, e.g., in Cloning Vectors: A Laboratory Manual (P.H. 

q Pouwels et al., 1985, Supp. 1987); Gasser and Fraley (supra); Clontech Molecular 

u 20 Biology Catalog (Catalog 1992/93 Tools for the Molecular Biologist, Palo Alto, CA); and 

the references cited above. Other expression constructs are described by Fraley et al 

(U.S. Pat. No. 5,352,605). 

Construction of Plant Transgenes 

Most preferably, an SSE polypeptide is produced by a stably-transfected plant 
25 cell line, a transiently-transfected plant cell line, or by a transgenic plant. A number of 

vectors suitable for stable or extrachromosomal transfection of plant cells or for the 

establishment of transgenic plants are available to the public; such vectors are described 
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in Pouwels et al. {supra), Weissbach and Weissbach (supra), and Gelvin et al. (supra). 
Methods for constructing such cell lines are described in, e.g., Weissbach and Weissbach 
(supra), and Gelvin et al. (supra). 

Typically, plant expression vectors include (1) a cloned plant gene under the 
5 transcriptional control of 5 ' and 3 ' regulatory sequences and (2) a dominant selectable 
marker. Such plant expression vectors may also contain, if desired, a promoter regulatory 
region (for example, one conferring inducible or constitutive, pathogen- or wound- 
induced, environmentally- or developmentally-regulated, or cell- or tissue-specific 
expression), a transcription initiation start site, a ribosome binding site, an RNA 
10 processing signal, a transcription termination site, and/or a polyadenylation signal. 

Once the desired SSE nucleic acid sequence is obtained as described above, it 
in may be manipulated in a variety of ways known in the art. For example, where the 

JI sequence involves non-coding flanking regions, the flanking regions may be subjected to 

] j} mutagenesis. 

H 15 The SSE DNA sequence of the invention may, if desired, be combined with 

! 3 " other DNA sequences in a variety of ways. The SSE DNA sequence of the invention 

g may be employed with all or part of the gene sequences normally associated with the SSE 

Q protein. In its component parts, a DNA sequence encoding an SSE protein is combined 

Q in a DNA construct having a transcription initiation control region capable of promoting 

JM# 20 transcription and translation in a host cell. 

In general, the constructs will involve regulatory regions functional in plants 
which provide for modified production of SSE protein as discussed herein. The open 
reading frame coding for the SSE protein or functional fragment thereof will be joined at 
its 5' end to a transcription initiation regulatory region such as the sequence naturally 
25 found in the 5' upstream region of the SSE structural gene. Numerous other transcription 
initiation regions are available which provide for constitutive or inducible regulation. 
For applications where developmental, cell, tissue, hormonal, or 
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environmental expression is desired, appropriate 5' upstream non-coding regions are 
obtained from other genes, for example, from genes regulated during meristem 
development, seed development, embryo development, or leaf development. 

Regulatory transcript termination regions may also be provided in DNA 
5 constructs of this invention as well. Transcript termination regions may be provided by 
the DNA sequence encoding the SSE protein or any convenient transcription termination 
region derived from a different gene source. The transcript termination region will 
contain preferably at least 1-3 kb of sequence 3 ? to the structural gene from which the 
termination region is derived. Plant expression constructs having SSE as the DNA 
10 sequence of interest for expression (in either the sense or antisense orientation) may be 
employed with a wide variety of plant life, particularly plant life involved in the 
; |i production of storage reserves (for example, those involving carbon and nitrogen 

|f metabolism). Such genetically-engineered plants are useful for a variety of industrial and 

tfl agricultural applications as discussed infra. Importantly, this invention is applicable to 

%| 15 dicotyledons and monocotyledons, and will be readily applicable to any new or improved 
j s w transformation or regeneration method. 

^ The expression constructs include at least one promoter operably linked to at 

Q least one SSE gene. An example of a useful plant promoter according to the invention is 

^ a caulimovirus promoter, for example, a cauliflower mosaic virus (CaMV) promoter. 

w 20 These promoters confer high levels of expression in most plant tissues, and the activity of 
these promoters is not dependent on virally encoded proteins. CaMV is a source for both 
the 35S and 19S promoters. Examples of plant expression constructs using these 
promoters are found in Fraley et al., U.S. Pat. No. 5,352,605. In most tissues of 
transgenic plants, the CaMV 35S promoter is a strong promoter (see, e.g., Odell et al, 
25 Nature 313:810, 1985). The CaMV promoter is also highly active in monocots (see, e.g., 
Dekeyser et al., Plant Cell 2:591, 1990; Terada and Shimamoto, Mol. Gen. Genet. 
220:389, 1990). Moreover, activity of this promoter can be further increased (i.e., 
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between 2-10 fold) by duplication of the CaMV 35S promoter (see e.g., Kay et al, 
Science 236:1299, 1987; Ow et al., Proc. Natl Acad. Sci., U.S.A. 84:4870, 1987; and 
Fang et al, Plant Cell 1:141, 1989, and McPherson and Kay, U.S. Pat. No. 5,378,142). 
Other useful plant promoters include, without limitation, the nopaline 
5 synthase (NOS) promoter (An et al, Plant Physiol. 88:547, 1988 and Rodgers and Fraley, 
U.S. Pat. No. 5,034,322), the octopine synthase promoter (Fromm et al., Plant Cell 1:977, 
1989), figwort mosiac virus (FMV) promoter (Rogers, U.S. Pat. No. 5,378,619), and the 
rice actin promoter (Wu and McElroy, W09 1/09948). 

Exemplary monocot promoters include, without limitation, commelina yellow 
10 mottle virus promoter, sugar cane badna virus promoter, rice tungro bacilliform virus 
promoter, maize streak virus element, and wheat dwarf virus promoter. 
i|| For certain applications, it may be desirable to produce the SSE gene product 

I?: in an appropriate tissue, at an appropriate level, or at an appropriate developmental time. 

j£J For this purpose, there are an assortment of gene promoters, each with its own distinct 

\f 15 characteristics embodied in its regulatory sequences, shown to be regulated in response to 
r ~ inducible signals such as the environment, hormones, and/or developmental cues. These 

} % include, without limitation, gene promoters that are responsible for heat-regulated gene 

□ expression (see, e.g., Callis et al., Plant Physiol. 88:965, 1988; Takahashi and Komeda, 

q Mol. Gen. Genet. 219:365, 1989; and Takahashi et al. Plant J. 2:751, 1992), light- 

20 regulated gene expression (e.g., the pea rbcS-3A described by Kuhlemeier et al, Plant 
Cell 1 :471, 1989; the maize rbcS promoter described by Schaffher and Sheen, Plant Cell 
3:997, 1991; the chlorophyll a/b-binding protein gene found in pea described by Simpson 
et al., EMBO J. 4:2723, 1985; the Arabssu promoter; or the rice rbs promoter), hormone- 
regulated gene expression (for example, the abscisic acid (ABA) responsive sequences 
25 from the Em gene of wheat described by Marcotte et al, Plant Cell 1 :969, 1989; the 
ABA-inducible HVA1 and HVA22, and rd29A promoters described for barley and 
Arabidopsis by Straub et al, Plant Cell 6:617, 1994 and Shen et al., Plant Cell 7:295, 
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1995; and wound-induced gene expression (for example, of wunl described by Siebertz 
et al., Plant Cell 1:961, 1989), organ-specific gene expression (for example, of the tuber- 
specific storage protein gene described by Roshal et al, EMBO J. 6:1155, 1987; the 23- 
kDa zein gene from maize described by Schernthaner et al., EMBO J. 7:1249, 1988; or 
the French bean 8-phaseolin gene described by Bustos et al., Plant Cell 1:839, 1989), or 
pathogen-inducible promoters (for example, PR-1, prp-1, or p-1,3 glucanase promoters, the 
fiingal-inducible wirla promoter of wheat, and the nematode-inducible promoters, 
TobRB7-5A and Hmg-1, of tobacco and parsley, respectively). 

Plant expression vectors may also optionally include RNA processing signals, 
e.g, introns, which have been shown to be important for efficient RNA synthesis and 
accumulation (Callisetal, Genes and Dev. 1:1183, 1987). The location of the RNA 
splice sequences can dramatically influence the level of transgene expression in plants. 
In view of this fact, an intron may be positioned upstream or downstream of an SSE 
polypeptide-encoding sequence in the transgene to modulate levels of gene expression. 

In addition to the aforementioned 5' regulatory control sequences, the 
expression vectors may also include regulatory control regions which are generally 
present in the 3' regions of plant genes (Thornburg et al, Proc. Natl. Acad. Sci. U.S.A. 
84:744, 1987; An et al., Plant Cell 1:115, 1989). For example, the 3' terminator region 
may be included in the expression vector to increase stability of the mRNA. One such 
terminator region may be derived from the PI-II terminator region of potato. In addition, 
other commonly used terminators are derived from the octopine or nopaline synthase 
signals. 

The plant expression vector also typically contains a dominant selectable 
marker gene used to identify those cells that have become transformed. Useful selectable 
genes for plant systems include genes encoding antibiotic resistance genes, for example, 
those encoding resistance to hygromycin, kanamycin, bleomycin, G418, streptomycin, or 
spectinomycin. Genes required for photosynthesis may also be used as selectable 
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markers in photosynthetic-deficient strains. Finally, genes encoding herbicide resistance 
may be used as selectable markers; useful herbicide resistance genes include the bar gene 
encoding the enzyme phosphinothricin acetyltransferase and conferring resistance to the 
broad spectrum herbicide Basta® (Hoechst AG, Frankfurt, Germany). 
5 Efficient use of selectable markers is facilitated by a determination of the 

susceptibility of a plant cell to a particular selectable agent and a determination of the 
concentration of this agent which effectively kills most, if not all, of the transformed 
cells. Some useful concentrations of antibiotics for tobacco transformation include, e.g., 
75-100 |ag/mL (kanamycin), 20-50 (ig/mL (hygromycin), or 5-10 (ig/mL (bleomycin). A 
10 useful strategy for selection of transformants for herbicide resistance is described, e.g., by 
Vasil et al., supra. 

H In addition, if desired, the plant expression construct may contain a modified 

]* or fully-synthetic structural SSE coding sequence which has been changed to enhance the 

i! performance of the gene in plants. Methods for constructing such a modified or synthetic 

H 15 gene are described in Fischoff and Perlak, U.S. Pat. No. 5,500,365. 
3 It will also be readily appreciated by those skilled in the art that a wide variety 

? of methods are known for engineering transgene constructs which silence or inactivate an 

*f endogenous gene. Homologous recombination is but one of the methods known to those 

3 skilled in the art for rendering an endogenous gene inoperative. Thus, when the 

20 engineered gene is homologously recombined into the plant, the endogenous gene will be 
rendered inoperative. An overview of this general process is provided in Yoder et al. 
("Progress Towards Gene Targeting in Plants," Genetic Engineering, Vol. 13 , Plenum 
Press, New York, 1991). Accordingly, gene targeting can be used to silence or replace 
the endogenous gene with an engineered allele; thus the phenotype of the altered gene, or 
25 its regulatory sequences, can be evaluated in planta. In addition, methods for 

constructing transgene constructs for silencing or inactivating gene expression in plants 
using antisense or co-suppression technologies are well known in the art. 
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It should be readily apparent to one skilled in the art of molecular biology, 
especially in the field of plant molecular biology, that the level of gene expression of a 
transgene construct is dependent, not only on the combination of promoters, RNA 
processing signals, and terminator elements, but also on how these elements are used to 
5 increase the levels of selectable marker gene expression. 
Plant Transformation 

Upon construction of the plant expression vector, several standard methods 
are available for introduction of the vector into a plant host, thereby generating a 
transgenic plant. These methods include (1) Agrobacterium-mediated transformation (A. 
10 tumefaciens or A rhizogenes) (see, e.g., Lichtenstein and Fuller In: Genetic Engineering, 
vol 6, PWJ Rigby, ed, London, Academic Press, 1987; and Lichtenstein, CP., and 
3 Draper, J,. In: DNA Cloning, Vol II, D.M. Glover, ed, Oxford, IRI Press, 1 985)), (2) the 

J particle delivery system (see, e.g., Gordon-Kamm et al., Plant Cell 2:603 (1 990); or 

;J1 BioRad Technical Bulletin 1687, supra), (3) microinjection protocols (see, e.g., Green et 

\| 15 al., supra), (4) polyethylene glycol (PEG) procedures (see, e.g., Draper et al., Plant Cell 
Physiol. 23:451, 1982; or e.g., Zhang and Wu, Theor. Appl. Genet. 76:835, 1988), (5) 
liposome-mediated DNA uptake (see, e.g., Freeman et al., Plant Cell Physiol. 25:1353, 
Q 1984), (6) electroporation protocols (see, e.g., Gelvin et al, supra; Dekeyser et al, supra; 

J Fromm et al, Nature 319:791, 1986; Sheen Plant Cell 2:1027, 1990; or Jang and Sheen 

w 20 Plant Cell 6:1665, 1994), and (7) the vortexing method (see, e.g., Kindle supra). The 

method of transformation is not critical to the invention. Any method which provides for 
efficient transformation may be employed. As newer methods are available to transform 
crops or other host cells, they may be directly applied. Suitable plants for use in the 
practice of the invention include, but are not limited to, sugar cane, wheat, rice, maize, 
25 sugar beet, potato, barley, manioc, sweet potato, soybean, sorghum, cassava, banana, 
grape, oats, tomato, millet, coconut, orange, rye, cabbage, apple, watermelon, canola, 
cotton, carrot, garlic, onion, pepper, strawberry, yam, peanut, onion, bean, pea, mango, 
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citrus plants, walnuts, and sunflower. 

The following is an example outlining one particular technique, an 
Agrobacterium-mediated plant transformation. By this technique, the general process for 
manipulating genes to be transferred into the genome of plant cells is carried out in two 
5 phases. First, cloning and DNA modification steps are carried out in E. coli, and the 
plasmid containing the gene construct of interest is transferred by conjugation or 
electroporation into Agrobacterium. Second, the resulting Agrobacterium strain is used 
to transform plant cells. Thus, for the generalized plant expression vector, the plasmid 
contains an origin of replication that allows it to replicate in Agrobacterium and a high 
10 copy number origin of replication functional in E. colt This permits facile production 
and testing of transgenes in E. coli prior to transfer to Agrobacterium for subsequent 
:g introduction into plants. Resistance genes can be carried on the vector, one for selection 

*Zz in bacteria, for example, streptomycin, and another that will function in plants, for 

y I example, a gene encoding kanamycin resistance or herbicide resistance. Also present on 

Vs 15 the vector are restriction endonuclease sites for the addition of one or more transgenes 
|* and directional T-DNA border sequences which, when recognized by the transfer 

functions of Agrobacterium, delimit the DNA region that will be transferred to the plant. 
□ In another example, plant cells may be transformed by shooting into the cell 

H tungsten microprojectiles on which cloned DNA is precipitated. In the Biolistic 

^ 20 Apparatus (Bio-Rad) used for the shooting, a gunpowder charge (22 caliber Power Piston 
Tool Charge) or an air-driven blast drives a plastic macroprojectile through a gun barrel. 
An aliquot of a suspension of tungsten particles on which DNA has been precipitated is 
placed on the front of the plastic macroprojectile. The latter is fired at an acrylic stopping 
plate that has a hole through it that is too small for the macroprojectile to pass through. 
25 As a result, the plastic macroprojectile smashes against the stopping plate, and the 

tungsten microprojectiles continue toward their target through the hole in the plate. For 
the instant invention the target can be any plant cell, tissue, seed, or embryo. The DNA 
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introduced into the cell on the microprojectiles becomes integrated into either the nucleus 
or the chloroplast 

In general, transfer and expression of transgenes in plant cells are now routine 
practices to those skilled in the art, and have become major tools to carry out gene 
5 expression studies in plants and to produce improved plant varieties of agricultural or 
commercial interest. 
Transgenic Plant Regeneration 

Plant cells transformed with a plant expression vector can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant tissue 

10 culture techniques. It is well known in the art that various cells, tissues, and organs from 
almost any plant can be successfully cultured to regenerate an entire plant; such 
techniques are described, e.g., in Vasil supra; Green et al., supra; Weissbach and 
Weissbach, supra; and Gelvin et al., supra. 

In one particular example, a cloned SSE polypeptide construct under the 

15 control of the 35S CaMV promoter and the nopaline synthase terminator and carrying a 
selectable marker (for example, kanamycin resistance) is transformed into 
Agrobacterium. Transformation of leaf discs (for example, of tobacco or potato leaf 
discs), with vector-containing Agrobacterium is carried out as described by Horsch et al. 
{Science 227:1229, 1985). Putative transformants are selected after a few weeks (for 

20 example, 3 to 5 weeks) on plant tissue culture media containing kanamycin (e.g. 100 
Hg/mL). Kanamycin-resistant shoots are then placed on plant tissue culture media 
without hormones for root initiation. Kanamycin-resistant plants are then selected for 
greenhouse growth. If desired, seeds from self-fertilized transgenic plants can then be 
sowed in a soil-less medium and grown in a greenhouse. Kanamycin-resistant progeny 

25 are selected by sowing surfaced sterilized seeds on hormone-free kanamycin-containing 
media. Analysis for the integration of the transgene is accomplished by standard 
techniques (see, for example, Ausubel et al supra; Gelvin et al. supra). 
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Transgenic plants expressing the selectable marker are then screened for 
transmission of the transgene DNA by standard immunoblot and DNA detection 
techniques. Each positive transgenic plant and its transgenic progeny are unique in 
comparison to other transgenic plants established with the same transgene. Integration of 
5 the transgene DNA into the plant genomic DNA is in most cases random, and the site of 
integration can profoundly affect the levels and the tissue and developmental patterns of 
transgene expression. Consequently, a number of transgenic lines are usually screened 
for each transgene to identify and select plants with the most appropriate expression 
profiles. 

10 Transgenic lines are evaluated for levels of transgene expression. Expression 

at the RNA level is determined initially to identify and quantitate expression-positive 
; ; fl plants. Standard techniques for RNA analysis are employed and include PCR 

p amplification assays using oligonucleotide primers designed to amplify only transgene 

;rj RNA templates and solution hybridization assays using transgene-specific probes (see, 

\J 15 e.g., Ausubel et al. 5 supra). The RNA-positive plants are then analyzed for protein 
K ~ expression by Western immunoblot analysis using SSE specific antibodies (see, e.g., 

i; p Ausubel et al., supra). In addition, in situ hybridization and immunocytochemistry 

Q according to standard protocols can be done using transgene-specific nucleotide probes 

p and antibodies, respectively, to localize sites of expression within transgenic tissue. 

20 In addition, if desired, once the recombinant SSE protein is expressed in any 

cell or in a transgenic plant (for example, as described above), it may be isolated, e.g., 
using affinity chromatography. In one example, an anti-SSE polypeptide antibody (e.g., 
produced as described in Ausubel et al., supra, or by any standard technique) may be 
attached to a column and used to isolate the polypeptide. Lysis and fractionation of SSE- 
25 producing cells prior to affinity chromatography may be performed by standard methods 
(see, e.g., Ausubel et al., supra). Once isolated, the recombinant protein can, if desired, 
be further purified, for example, by high performance liquid chromatography (see, e.g., 
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Fisher, Laboratory Techniques In Biochemistry And Molecular Biology, eds., Work and 
Burdon, Elsevier, 1980). 

Polypeptides of the invention, particularly short SSE protein fragments, can 
also be produced by chemical synthesis (e.g., by the methods described in Solid Phase 
5 Peptide Synthesis, 2nd ed., 1984 The Pierce Chemical Co., Rockford, IL). These general 
techniques of polypeptide expression and purification can also be used to produce and 
isolate useful SSE fragments or analogs. 
Engineering Storage Reserve Materials 

As discussed above, plasmid constructs designed for the expression of SSE 
10 gene products are useful, for example, for modifying or altering seed or vegetative 

storage reserve profiles. SSE genes that are isolated from a host plant (e.g., Arabidopsis 
J or Brassica) may be engineered for increased or decreased expression in the same plant, a 

1" closely related species, or a distantly related plant species. For example, the cruciferous 

ill Arabidopsis SSE1 gene may be engineered for constitutive expression and then 

xj 15 transformed into an Arabidopsis host plant. Alternatively, the Arabidopsis SSE1 gene 

m i 

; * may be engineered for expression other cruciferous plants, such as the Brassicas (for 

J example, broccoli, cabbage, and cauliflower). Evaluation of the modification conferred 

Q on a plant by ectopic expression of an SSE gene is determined according to conventional 

q methods and assays (for example, those described herein). 

w 20 In one working example, constitutive expression of the SSE1 gene of 

Arabidopsis (Fig. 2A; SEQ ID NO:l) is used to alter seed storage reserve deposition in 
transgenic seeds of Brassica. In one particular example, a plant expression vector is 
constructed that contains an SSE1 cDNA sequence expressed under the control of the 
enhanced CaMV 35S promoter as described by McPherson and Kay (U.S. Patent No. 
25 5,359,142). This expression vector is then used to transform Brassica according to the 
methods described in Moloney et al. (U.S. Patent No. 5,750,827). Seeds of transformed 
Brassica and control plants are then profiled for storage reserve material according to 
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conventional methods to determine the qualitative and quantitative aspects of the 
deposited reserve materials. Transformed plants that express an SSE1 gene and produce 
seeds having an increased level of storage reserve material (e.g., reserve lipid or storage 
protein) relative to control plants are taken as being useful in the invention. 

In another working example, gene silencing or inactivation technologies may 
also be used to modify or alter the deposition of seed storage reserve material. 
Exemplary methods for silencing or inactivating plant gene expression include antisense 
RNA (Shewmaker et al, U.S. Patent 5,107,065), co-suppression (Napoli et al, U.S. Patent 
5,034,327), and homologous recombination (Offringa et al., United States Patent 
5,501,967). For example, a plant expression vector is constructed that contains an 
antisense SSE1 which is expressed under the control of the enhanced CaMV 35S 
promoter as described by McPherson and Kay, supra and Shewmaker at al. (U.S. Patent 
No. 5,107,065). This expression vector is then used to transform Brassica according to 
the methods described in Moloney et al, supra. To assess reserve material deposition, 
transformed plants and appropriate controls are grown, and the storage reserves of their 
seeds are evaluated according to standard methods, for example, those described herein. 
Transformed Brassica plants that express an antisense SSE1 sequence and that produce 
seeds having a decreased level of reserve material relative to control plants are taken as 
being useful in the invention. 
Engineering Stress-Protected Plants 

As discussed above, because constitutive expression of an SSE1 has been 
found to confer dessication tolerance, constructs designed for the expression of an SSE 
polypeptide (or an SSE 1 homolog) are useful for generating transgenic seeds having an 
increased level of tolerance to environmental stress. To achieve such tolerance, it is 
important to express such a protein at an effective level in a transgenic seed. Seed- 
specific gene promoters are especially useful for this purpose. Evaluation of the level of 
stress protection conferred to a seed by expression of a DNA sequence expressing an 
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SSE1 polypeptide is determined according to conventional methods and assays as 
described below. 

In one working example, seed-specific expression of an SSE gene, for 
example, the SSE1, is used in Brassica to enhance salt stress tolerance. For example, a 
plant expression vector is constructed that contains an SSE1 sequence expressed under 
the control of a Brassica seed-specific promoter. This expression vector is then used to 
transform Brassica according to standard methods. To assess salt tolerance, seeds 
obtained from transformed Brassica plants and appropriate controls are evaluated 
according to standard methods. Transgenic seeds containing the gene are germinated in 
the presence of various salt or osmotically active solutions to determine whether 
transgenic seeds demonstrate increased tolerance or resistance to salt stress. If desired, 
seedlings can also be grown in hydroponic systems and challenged with salt or agents of 
differing osmotic potentials at different, or all, developmental stages in order to assess the 
response of SSE1 -expressing plants to these stresses. Growth and physiological 
measurements are used to document the differences. Transformed Brassica plants which 
produce seeds having an increased level of salt tolerance relative to control plants are 
taken as being useful in the invention. 
Engineering Plants Having Increased Yield/Productivity 

Seeds of transgenic plants expressing a recombinant SSE gene (or an SSE1 
homolog) are planted out in test plots, and their agronomic performance is compared to 
standard plants using techniques familiar to those of skill in the art. Optionally included 
in this comparison are plants of similar genetic background without the transgene. A 
yield benefit is observed and plants exhibiting the increased yield are advanced for 
commercialization. 

In addition, transgenic plants expressing an SSE gene (or an SSE1 homolog) 
are field tested for agronomic performance under conditions, including, but not limited to, 
limited or inadequate water availability. When compared to nontransgenic plants, 
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transgenic plants expressing the SSE1 gene exhibit higher yield than their non-transgenic 
counterparts under non-optimal growing conditions. 
SSE Interacting Polypeptides 

The isolation of SSE sequences also facilitates the identification of 
5 polypeptides which interact with the SSE protein. Such polypeptide-encoding sequences 
are isolated by any standard two hybrid system (see, for example, Fields et aL, Nature 
340:245-246, 1989; Yang et aL, Science 257:680-682, 1992; Zervos et aL, Cell 72:223- 
232, 1993). For example, all or a part of the SSE sequence may be fused to a DNA 
binding domain (such as the GAL4 or LexA DNA binding domain). After establishing 

10 that this fusion protein does not itself activate expression of a reporter gene (for example, 
a lacZ or LEU2 reporter gene) bearing appropriate DNA binding sites, this fusion protein 
is used as an interaction target. Candidate interacting proteins fused to an activation 
domain (for example, an acidic activation domain) are then co-expressed with the SSE 
fusion in host cells, and interacting proteins are identified by their ability to contact the 

15 SSE sequence and stimulate reporter gene expression. SSE-interacting proteins identified 
using this screening method provide good candidates for proteins that are involved in the 
acquired resistance signal transduction pathway. 
Antibodies 

SSE polypeptides described herein (or imunogenic fragments or analogs) may 
20 be used to raise antibodies useful in the invention; such polypeptides may be produced by 
recombinant or peptide synthetic techniques (see, e.g., Solid Phase Peptide Synthesis, 2nd 
ed., 1984, Pierce Chemical Co., Rockford, IL; Ausubel et aL, supra). The peptides may 
be coupled to a carrier protein, such as KLH as described in Ausubel et al, supra. The 
KLH-peptide is mixed with Freund's adjuvant and injected into guinea pigs, rats, or 
25 preferably rabbits. Antibodies may be purified by peptide antigen affinity 
chromatography. 

Monoclonal antibodies may be prepared using the SSE polypeptides described 



-41 - 



above and standard hybridoma technology (see, e.g., Kohler et al., Nature 256:495, 1975; 
Kohler et al., Eur. J. Immunol. 6:511, 1976; Kohler et al., Eur. J. Immunol. 6:292, 1976; 
Hammerling et al., In Monoclonal Antibodies and T Cell Hybridomas, Elsevier, NY, 
1981; Ausubel et al, supra), 
5 Once produced, polyclonal or monoclonal antibodies are tested for specific 

SSE recognition by Western blot or immunoprecipitation analysis (by the methods 
described in Ausubel et al., supra). Antibodies which specifically recognize SSE 
polypeptides are considered to be useful in the invention; such antibodies may be used, 
e.g., in an immunoassay to monitor the level of SSE polypeptide produced by a plant. 

10 

Other Embodiments 
The invention further includes analogs of any naturally-occurring plant SSE 
polypeptide. Analogs can differ from the naturally-occurring SSE protein by amino acid 
sequence differences, by post-translational modifications, or by both. Analogs of the 

15 invention will generally exhibit at least 40%, more preferably 50%, and most preferably 
60% or even having 70%, 80%, or 90% identity with all or part of a naturally-occurring 
plant SSE amino acid sequence. The length of sequence comparison is at least 15 amino 
acid residues, preferably at least 25 amino acid residues, and more preferably more than 
35 amino acid residues. Modifications include in vivo and in vitro chemical 

20 derivatization of polypeptides, e.g., acetylation, carboxylation, phosphorylation, or 

glycosylation; such modifications may occur during polypeptide synthesis or processing 
or following treatment with isolated modifying enzymes. Analogs can also differ from 
the naturally-occurring SSE polypeptide by alterations in primary sequence. These 
include genetic variants, both natural and induced (for example, resulting from random 

25 mutagenesis by irradiation or exposure to ethyl methylsulfate or by site-specific 

mutagenesis as described in Sambrook, Fritsch and Maniatis, supra, or Ausubel et al., 
supra). Also included are cyclized peptides, molecules, and analogs which contain 
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residues other than L-amino acids, e.g., D-amino acids or non-naturally occurring or 
synthetic amino acids, e.g., p or y amino acids. 

In addition to full-length polypeptides, the invention also includes SSE 
polypeptide fragments. As used herein, the term "fragment," means at least 20 
5 contiguous amino acids, preferably at least 30 contiguous amino acids, more preferably at 
least 50 contiguous amino acids, and most preferably at least 60 to 80 or more contiguous 
amino acids. Fragments of SSE polypeptides can be generated by methods known to 
those skilled in the art or may result from normal protein processing (e.g., removal of 
amino acids from the nascent polypeptide that are not required for biological activity or 
10 removal of amino acids by alternative mRNA splicing or alternative protein processing 
events). In preferred embodiments, an SSE polypeptide fragment includes an ankyrin- 
repeat motif as described herein. In other preferred embodiments, an SSE fragment is 
^ capable of interacting with a second polypeptide component of the SSE signal 

;«55s 

1 11 transduction cascade. 

%| 15 Furthermore, the invention includes nucleotide sequences that facilitate 

■ specific detection of an SSE nucleic acid. Thus, SSE sequences described herein or 
y portions thereof may be used as probes to hybridize to nucleotide sequences from other 

p plants (e.g., dicots, monocots, gymnosperms, and algae) by standard hybridization 

q techniques under conventional conditions. Sequences that hybridize to an SSE coding 

W 20 sequence or its complement and that encode an SSE polypeptide are considered useful in 
the invention. As used herein, the term "fragment," as applied to nucleic acid sequences, 
means at least 5 contiguous nucleotides, preferably at least 10 contiguous nucleotides, 
more preferably at least 20 to 30 contiguous nucleotides, and most preferably at least 40 
to 80 or more contiguous nucleotides. Fragments of SSE nucleic acid sequences can be 
25 generated by methods known to those skilled in the art. 

All publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each independent publication or 
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patent application was specifically and individually indicated to be incorporated by 
reference. 



What is claimed is: 
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Claims 



1 . An isolated nucleic acid molecule comprising a sequence encoding an 
SSE polypeptide. 

2. The nucleic acid molecule of claim 1, wherein said sequence encodes an 
SSE polypeptide having at least 30% identity with the amino acid sequence shown in 
Fig. 2B (SEQ ID NO:2). 

3. The nucleic acid molecule of claim 1, wherein said sequence encodes an 
SSE polypeptide that, when expressed in a cell of a plant, modifies the production of food 
storage reserves. 

4. The nucleic acid molecule of claim 1, wherein said sequence encodes an 
SSE polypeptide that, when expressed in a cell of a plant, facilitates the intracellular 
transport of a storage protein. 

5. The nucleic acid molecule of claim 1, wherein said sequence encodes an 
SSE polypeptide that, when expressed in a cell of a plant, facilitates the formation of 
protein bodies. 

6. The nucleic acid molecule of claim 1, wherein said sequence encodes an 
SSE polypeptide that, when expressed in a cell of a plant, facilitates the formation of oil 
bodies. 
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7. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule 

is cDNA. 



8. An isolated nucleic acid molecule comprising a sequence encoding an 
SSE polypeptide, wherein said isolated nucleic acid molecule hybridizes specifically to 

5 the nucleic acid molecule comprising the cDNA of Fig. 2A (SEQ ID NO: 1). 

9. The nucleic acid molecule of claim 1, wherein said sequence encodes an 
SSE polypeptide having at least 30% identity with the amino acid sequence shown in Fig. 
2B (SEQ ID NO:2). 

10 10. The isolated nucleic acid molecule of claim 1 or 8, wherein said nucleic 

acid molecule is operably linked to a promoter functional in a plant cell. 



11. An expression vector comprising the nucleic acid molecule of claim 1 
or 8, said vector being capable of directing expression of the polypeptide encoded by said 
nucleic acid molecule. 



15 



12. 



A cell comprising the isolated nucleic acid molecule of claim 1 or 8. 



13. 



The cell of claim 12, wherein said cell is a plant cell. 



14. 



The cell of claim 12, wherein said cell is a bacterial cell. 



15. 



The cell of claim 12, wherein said bacterial cell is Agrobacterium. 



16. 



A transgenic plant or transgenic plant component comprising a nucleic 
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acid molecule of claim 1 or 8, wherein said nucleic acid molecule is expressed in said 
transgenic plant or said transgenic plant component. 



17. The plant or plant component of claim 16, wherein said transgenic plant 
or transgenic plant component is an angiosperm. 

5 18. The plant or plant component of claim 1 6, wherein said transgenic plant 

or transgenic plant component is a dicot. 

1 9. The plant or plant component of claim 1 6, wherein said transgenic plant 
or transgenic plant component is a cruciferous plant. 

20. The plant or plant component of claim 1 6, wherein said transgenic plant 
or transgenic plant component is a monocot. 

21 . A seed from a transgenic plant or transgenic plant component of claim 

16. 

22. A cell from a transgenic plant or transgenic plant component of claim 

16. 

15 23 . An expression vector for producing antisense SSE RNA. 

24. A transgenic plant or transgenic plant component comprising the vector 
of claim 23. 

25. A seed from a transgenic plant or transgenic plant component of claim 
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24. 



26. A cell from a transgenic plant or transgenic plant component of claim 

24. 

27. A substantially pure SSE polypeptide comprising an amino acid 

5 sequence having at least 30% identity to the amino acid sequence of Fig. 2B (SEQ ID 
NO:2). 

28. The polypeptide of claim 27, wherein said polypeptide modifies the 
production of food storage reserves. 

t! 29. The polypeptide of claim 27, wherein said polypeptide facilitates the 

1 0 intracellular transport of a storage protein. 

Sf 

30. The polypeptide of claim 27, wherein said polypeptide facilitates the 

^ formation of protein bodies. 

q 31. The polypeptide of claim 27, wherein said polypeptide facilitates the 

wi 1 5 formation of oil bodies. 

32. A method of producing an SSE polypeptide, said method comprising the 

steps of: 

(a) providing a cell transformed with a nucleic acid molecule of claim 1 or 8 
positioned for expression in the cell; 
20 (b) culturing the transformed cell under conditions for expressing the nucleic 

acid molecule; and 
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(c) recovering the SSE polypeptide. 

33 . A recombinant SSE polypeptide produced according to the method of 

claim 32. 

34. A substantially pure antibody that specifically recognizes and binds to an 
5 SSE polypeptide or a portion thereof 

35. The antibody of claim 34, wherein said antibody recognizes and binds to 
a recombinant SSE polypeptide or a portion thereof. 

36. A method of isolating an SSE gene or fragment thereof, said method 
comprising the steps of: 

5 (a) contacting the nucleic acid molecule of Fig. 2A (SEQ ID NO:l) or a 

portion thereof with a nucleic acid preparation from a plant cell under hybridization 
conditions providing detection of nucleic acid sequences having at least 30% or greater 
sequence identity to the nucleic acid sequence of Fig. 2A (SEQ ID NO:l); and 
(b) isolating said hybridizing nucleic acid sequences. 

10 37. A method of isolating an SSE gene or fragment thereof, said method 

comprising the steps of: 

(a) providing a sample of plant cell DNA; 

(b) providing a pair of oligonucleotides having sequence identity to a region 
of the nucleic acid of Fig. 2A (SEQ ID NO:l); 

1 5 (c) contacting the pair of oligonucleotides with said plant cell DNA under 

conditions suitable for polymerase chain reaction-mediated DNA amplification; and 
(d) isolating the amplified SSE gene or fragment thereof 
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38. The method of claim 37, wherein said amplification step is carried out 
using a sample of cDNA prepared from a plant cell. 



39. The method of claim 37, wherein said pair of oligonucleotides are based 
on a sequence encoding an SSE polypeptide, wherein the SSE polypeptide is at least 30% 
identical to the amino acid sequence of Fig. 2B (SEQ ID NO:2). 
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MODIFICATION OF PLANT STORAGE RESERVES 



Abstract of Disclosure 

Disclosed is a novel gene that is responsible for protein and oil body 
biogenesis. Methods are also provided for producing plants exhibiting one or more 
desired phenotypic traits relating to storage reserve material. 
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i believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and 
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specification of which 
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□ was filed on as Application Serial No. 
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PROVISIONAL PRIORITY RIGHTS: I hereby claim priority benefits under Title 35, United States Code, 
§1 19(e) and §120 of any United States provisional patent application(s) listed below filed by an inventor or 
inventors on the same subject matter as the present application and having a filing date before that of the 
application (s) of which priority is claimed: 



Serial Number 


Filing Date 


Status 


60/128,651 


April 8, 1999 


Pending 



Page 1 of 3 



COMBINED DECLARATION AND POWER OF ATTORNEY 



NON-PROVISIONAL PRIORITY RIGHTS: I hereby claim the benefit under Title 35, United States Code, 
§120 of any United States application(s) listed below and, insofar as the subject matter of each of the 
claims of this application is not disclosed in the prior United States application in the manner provided by 
the first paragraph of Title 35, United States Code, §1 12, 1 acknowledge the duty to disclose all information 
I know to be material to patentability as defined in Title 37, Code of Federal Regulations, §1 .56(a) which 
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made on information and belief are believed to be true; and further that these statements were made with 
the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, 
or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
may jeopardize the validity of the application or any patents issued thereon. 



Full Name 

(First, Middle, Last) 


Residence Address 
(City, State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Yun Lin 


Somerville, MA 






Signature: 


Date: 



Page 2 of 3 



COMBINED DECLARATION AND POWER OF ATTORNEY 



Full Name 

(First, Middle, Last) 


Residence Address 
(City, State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Lin Sun 


Roxbury, MA 






Signature: 


Date: 



Full Name 

(First, Middle, Last) 


Residence Address 
(City, State, Country) 


Post Office Address 
(Street, City, State, Country) 


Citizenship 


Long V. Nguyen 


Vacaville, CA 






Signature: 


Date: 





Full Name 


Residence Address 


Post Office Address 


Citizenship 




(First, Middle, Last) 


(City, State, Country) 


(Street, City, State, Country) 




Howard M. Goodman 


Newton, MA 








Signature: 


Date: 



Page 3 of 3 



PATENT 

ATTORNEY DOCKET NUMBER: 00786/368002 



Certificate of Mailing: Date of Deposit: April 7. 2000 

I hereby certify under 37 CFR 1.10 that this correspondence is being deposited with the United States Postal 
Service as EXPRESS MAIL with sufficient postage on the date indicated above and is addressed to: Box Patent 
Application, Assistant Commissioner of Patents, Washington, D.C. %022 

Marcellus Green 




Printed name of person mailing correspondence Signature of person mailing correspondence 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: Yun Lin et al. Art Unit: 

Serial No.: Examiner: 

Filed: April 7, 2000 

Title: MODIFICATION OF PLANT STORAGE RESERVES 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

STATEMENT UNDER 37 CFR §1 .821 
As part of the patent application filed herewith, enclosed is a sequence listing in 
accordance with the requirements of 37 CFR §§1.821 through 1.825 and consisting of 5 
pages. 

As required by 37 CFR 1.821(c), the sequence listing appears as a separate part of 
the application and is found after the Combined Declaration and Power of Attorney. 
Each sequence in the application appears separately in the sequence listing. And each 
sequence in the sequence listing is assigned a separate sequence identifier. 



As required by 37 CFR § 1 .821(d), the sequence identifiers are used throughout the 
application description and claims to refer to their respective sequences. 

As required by 37 CFR 1.821(e), enclosed is a diskette containing a copy of the 
sequence listing in computer readable form. 

As required by 37 CFR §1.821(f), I hereby state that the contents of the computer 
readable form are the same as the contents of the paper copy. 

As required by 37 CFR §1.821(g), I hereby state that this submission contains no 
new matter. 

If there are any charges, or any credits, please apply them to Deposit Account No. 
03-2095. 



Respectfully submitted, 



Clark & Elbing LLP 
176 Federal Street 
Boston, MA 021 10 



Date: 6Ww£ 7-j loOO 




Telephone: 617-428-0200 
Facsimile: 617-428-7045 



\\NTSERVER\documents\00786\368xxx\00786.368002 Sequence Statement 1.821 (first listing).wpd 



SEQUENCE LISTING 



<110> Yu Lin 
Lin Sun 

Long V. Nguyen 
Howard M. Goodman 

<12 0> MODIFICATION OF PLANT STORAGE RESERVES 



<130> 00786/368002 

<150> 60/128,651 
<151> 1999-04-08 

<160> 9 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1483 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 1 

attgcaacca ggaagagaaa gaaaatcaga gattgattta acgtgaatgg aattttgttg 60 

tttcccaaat tcttctgaga aatagcaaag ttcagttttg tttctctcta tctgaagctc 120 

aatggaagct tataagcaat gggtttggag aaatagagag tatgtacaat cctttggatc 180 

ctttgccaac ggattgacat ggctgcttcc tgagaagttt tctgcttcag agattggacc 240 

agaagcagta acggcttttt tgggcatatt cacaacgata aatgaacaca taattgaaaa 3 00 

tgctccaaca cctcgtggcc atgttggatc ttccgggaat gatccatccc tttcttatcc 360 

actactcatc gccatcctca aggatttgga aactgttgtg gaagtggcag ' ctgaacactt 42 0 

ctatggagac aaaaaatgga actacattat tctcactgaa gctatgaagg ctgtcattag 480 

gttagccttg ttccggaata gtgggtataa gatgcttctt caaggagggg aaacacctaa 54 0 

tgaggagaaa gattctaacc aatccgagtc gcaaaataga gctggtaatt cgggtagaaa 60 0 

tctcgggcct catggtcttg gaaaccaaaa tcatcataat ccatggaact tggaaggacg 660 

ggcgatgtct gctttaagtt catttggtca gaatgcaaga acaacaacat cttctacccc 72 0 

cggttggtct cgaagaattc aacatcagca agcagttata gagcctccaa tgatcaagga 780 

gaggcgaaga acgatgtccg agctacttac tgagaagggt gttaatggag cgttgtttgc 840 

gattggtgag gttctttaca taacgagacc gctcatttac gttcttttca tcagaaaata 900 

tggagtccga tcttggattc cttgggctat atcgctttct gtggacacac tggggatggg 960 

tcttcttgca aattcgaagt ggtggggaga gaagagcaag caagtccatt tctcaggacc 1020 

tgaaaaggat gagctgagga gacgaaaact gatatgggca ttgtacctca tgagagatcc 1080 

attcttcacc aagtacacaa ggcagaagct ggaaagctct caaaagaagc tggaactaat 114 0 

tccattgatc ggattcctca cagagaagat tgtggagctt ttggagggag ctcagtcacg 1200 

gtacacttac atatcgggat cgtgaggtta agcgttttac ttatggttta tatgcaacgg 12 60 

aagaatattg ccattgttgg aatgcttttt tagatcatca aaggctccta cagatttctt 132 0 

agggaatggt ttcaggcttt tgttagaaat tgtgtttatt gcaacaggta gagaacataa 13 80 

ccatagacag atgtatctga agagataagc ttctctatgt ctaaagaaat ggaccgatac 144 0 

gaataaaaca agcatcatta aagattaaaa aaaaaaaaaa aaa 1483 



l 



<210> 2 
<211> 367 
<212> PRT 

<213> Arafoidopsis thaliana 



<400> 2 

Met Glu Ala Tyr Lys Gin Trp Val Trp Arg Asn Arg Glu Tyr Val Gin 

15 10 15 

Ser Phe Gly Ser Phe Ala Asn Gly Leu Thr Trp Leu Leu Pro Glu Lys 

20 25 30 

Phe Ser Ala Ser Glu lie Gly Pro Glu Ala Val Thr Ala Phe Leu Gly 

35 40 45 

lie Phe Thr Thr lie Asn Glu His lie lie Glu Asn Ala Pro Thr Pro 

50 55 60 

Arg Gly His Val Gly Ser Ser Gly Asn Asp Pro Ser Leu Ser Tyr Pro 
65 70 75 80 

Leu Leu lie Ala lie Leu Lys Asp Leu Glu Thr Val Val Glu Val Ala 

85 90 95 

Ala Glu His Phe Tyr Gly Asp Lys Lys Trp Asn Tyr lie lie Leu Thr 

100 105 110 

Glu Ala Met Lys Ala Val lie Arg Leu Ala Leu Phe Arg Asn Ser Gly 

115 120 125 

Tyr Lys Met Leu Leu Gin Gly Gly Glu Thr Pro Asn Glu Glu Lys Asp 

130 135 140 

Ser Asn Gin Ser Glu Ser Gin Asn Arg Ala Gly Asn Ser Gly Arg Asn 
145 150 155 160 

Leu Gly Pro His Gly Leu Gly Asn Gin Asn His His Asn Pro Trp Asn 

165 170 175 

Leu Glu Gly Arg Ala Met Ser Ala Leu Ser Ser Phe Gly Gin Asn Ala 

180 185 190 

Arg Thr Thr Thr Ser Ser Thr Pro Gly Trp Ser Arg Arg lie Gin His 

195 200 205 

Gin Gin Ala Val lie Glu Pro Pro Met lie Lys Glu Arg Arg Arg Thr 

210 215 220 

Met Ser Glu Leu Leu Thr Glu Lys Gly Val Asn Gly Ala Leu Phe Ala 
225 230 235 240 

lie Gly Glu Val Leu Tyr lie Thr Arg Pro Leu lie Tyr Val Leu Phe 

245 250 255 

lie Arg Lys Tyr Gly Val Arg Ser Trp lie Pro Trp Ala lie Ser Leu 

260 265 270 

Ser Val Asp Thr Leu Gly Met Gly Leu Leu Ala Asn Ser Lys Trp Trp 

275 280 285 

Gly Glu Lys Ser Lys Gin Val His Phe Ser Gly Pro Glu Lys Asp Glu 

290 295 300 

Leu Arg Arg Arg Lys Leu lie Trp Ala Leu Tyr Leu Met Arg Asp Pro 
305 310 315 320 

Phe Phe Thr Lys Tyr Thr Arg Gin Lys Leu Glu Ser Ser Gin Lys Lys 

325 330 335 

Leu Glu Leu lie Pro Leu lie Gly Phe Leu Thr Glu Lys lie Val Glu 

340 345 350 

Leu Leu Glu Gly Ala Gin Ser Arg Tyr Thr Tyr lie Ser Gly Ser 



2 



355 360 365 

<210> 3 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 3 

atcagagatt gatttaacgt a 21 

<210> 4 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 4 

acgattttca attatgtgtt c 21 

<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 5 

cgcttggtcg gtcatttcg 19 

<210> 6 
<211> 391 
<212> PRT 

<213> Yarrowia lipolytica 
<400> 6 

Met Thr Asp Lys Leu Val Lys Val Met Gin Lys Lys Lys Ser Ala Pro 

1 5 10 15 

Gin Thr Trp Leu Asp Ser Tyr Asp Lys Phe Leu Val Arg Asn Ala Ala 

20 25 30 

Ser He Gly Ser He Glu Ser Thr Leu Arg Thr Val Ser Tyr Val Leu 

35 40 45 

Pro Gly Arg Phe Asn Asp Val Glu He Ala Thr Glu Thr Leu Tyr Ala 

50 55 60 

Val Leu Asn Val Leu Gly Leu Tyr His Asp Thr He He Ala Arg Ala 
65 70 75 80 



3 



Val Ala Ala Ser 

Asn Arg Tyr Thr 
100 

Ala Ser Arg Ala 
115 

Met Val Ala Lys 
130 

He Gly He Glu 
145 

Ser Thr Leu Tyr 

Val Thr Gly Glu 
180 

Asp He Glu Lys 
195 

Gly Arg Thr He 
210 

Leu Thr Lys Val 

225 

Leu Ser Arg Leu 

Leu Arg Pro Leu 
260 

Lys Thr Val Pro 
275 

Pro Trp Ala Pro 
290 

Lys Met Met Gly 
305 

Pro Thr Ala Leu 

Gly Trp Trp Leu 
340 

Leu Leu Tyr Ser 
355 

Leu Phe Gly Ala 
370 

Tyr Phe Thr Ala 
385 

<210> 7 

<211> 19 

<212> DMA 

<213> Artificial 

<220> 

<223> Primer 



Pro Asn Ala Ala 
85 

Asp Trp Phe He 

Val Thr Phe Val 
120 

Lys Asn Gly Gly 
135 

Gly He Lys Ala 
150 

Gin Pro Leu Cys 
165 

Leu Leu Glu Thr 

Gly Leu Met Asp 
200 

Pro Glu He Ala 
215 

Leu Arg Ser Glu 
230 

Asp Asn Trp Gly 
245 

He Tyr Ala Cys 

Ala Ser Thr Lys 
280 

Trp He He Gly 
295 

Ser Trp Leu Leu 
310 

Asp Gin Met Glu 
325 

Phe Arg Gly Glu 

He Val Ala Arg 
360 

Leu He Ser Asp 
375 

Ser Thr Leu 
390 



Sequence 



Ala Val Tyr Arg 
90 

Lys Asn Arg Lys 
105 

Lys Phe Gly Glu 

Glu Met Ala Arg 
140 

Gly Leu Arg He 
155 

Thr Thr Pro Tyr 
170 

He Cys Arg Asp 
185 

Pro Gin Trp Lys 

Pro Thr Asn Val 
220 

Asp Val Asp Arg 
235 

Val Val Ala Glu 
250 

Leu Leu Phe Arg 
265 

Ser Lys Phe Pro 

Leu Val lie Glu 
300 

Arg Gin Arg Gin 
315 

Val Lys Gly Arg 
330 

Phe Tyr Gin Ala 
345 

Leu Glu Lys He 

Tyr Leu Tyr Leu 
380 



Pro Ser Pro His 
95 

Gly Tyr Lys Tyr 
110 

Leu Val Ala Glu 
125 

Trp Lys Cys He 

Tyr Met Leu Gly 
160 

Pro Asp Arg Glu 
175 

Glu Gly Glu Leu 
190 

Met Pro Arg Thr 
205 

Glu Gly Tyr Leu 

Pro Tyr Asn Leu 
240 

Leu Leu Ser He 
255 

Gin His Val Asn 
270 

Phe Leu Asn Ser 
285 

Ala Leu Ser Arg 

Ser Gly Lys Thr 
320 

Thr Asn Leu Leu 
335 

Tyr Thr Arg Pro 
350 

Pro Gly Leu Gly 
365 

Phe Asp Arg Tyr 



<400> 7 

ggcaatattc ttccgttgc 



<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 8 

aaaaatggaa ctacattatt etc 

<210> 9 
<211> 22 
<212> DMA 

<213> Artificial Sequence 



<220> 

<223> Primer 



□ 
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<221> variation 
<222> (1) ... (22) 

<223> Where h is a, c, or t/u; not g. 
<400> 9 

ataagtaaaa cgcttaacct he 22 



5 



□ 
m 

:f 

Q 
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m 
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FIGURES 1A-1F 



1 attgcaacca ggaagagaaa 
61 tttcccaaat tcttctgaga 
121 aatggaagct tataagcaat 
181 ctttgccaac ggattgacat 
241 agaagcagta acggcttttt 
3 01 tgctccaaca cctcgtggcc 
361 actactcatc gccatcctca 
421 ctatggagac aaaaaatgga 
481 gttagccttg ttccggaata 
541 tgaggagaaa gattctaacc 
601 tctcgggcct catggtcttg 
661 ggcgatgtct gctttaagtt 
721 cggttggtct cgaagaattc 
7 81 gaggcgaaga acgatgtccg 
841 gattggtgag gttctttaca 
901 tggagtccga tcttggattc 
961 tcttcttgca aattcgaagt 
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1081 attcttcacc aagtacacaa 
1141 tccattgatc ggattcctca 
1201 gtacacttac atatcgggat 
1261 aagaatattg ccattgttgg 
1321 agggaatggt ttcaggcttt 
13 81 ccatagacag atgtatctga 
1441 gaataaaaca agcatcatta 



gaaaatcaga gattgattta 
aatagcaaag ttcagttttg 
gggtttggag aaatagagag 
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tgggcatatt cacaacgata 
atgttggatc ttccgggaat 
aggatttgga aactgttgtg 
actacattat tctcactgaa 
9tgggtataa gatgcttctt 
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aacatcagca agcagttata 
agctacttac tgagaagggt 
taacgagacc gctcatttac 
cttgggctat atcgctttct 
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gacgaaaact gatatgggca 
ggcagaagct ggaaagctct 
cagagaagat tgtggagctt 
cgtgaggtta agcgttttac 
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agagataagc ttctctatgt 
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acgtgaatgg aattttgttg 
tttctctcta tctgaagctc 
tatgtacaat cctttggatc 
tctgcttcag agattggacc 
aatgaacaca taattgaaaa 
gatccatccc tttcttatcc 
gaagtggcag ctgaacactt 
gctatgaagg ctgtcattag 
caaggagggg aaacacctaa 
gctggtaatt cgggtagaaa 
ccatggaact tggaaggacg 
acaacaacat cttctacccc 
gagcctccaa tgatcaagga 
gttaatggag cgttgtttgc 
gttcttttca tcagaaaata 
gtggacacac tggggatggg 
caagtccatt tctcaggacc 
ttgtacctca tgagagatcc 
caaaagaagc tggaactaat 
ttggagggag ctcagtcacg 
ttatggttta tatgcaacgg 
aaggctccta cagatttctt 
gcaacaggta gagaacataa 
ctaaagaaat ggaccgatac 
aaa 
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FIGURE 3C 
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